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EDITORIAL 


Reform and cooperation in China 


ince it was established as a science funding 
organization in 1986, the National Natural Sci- 
ence Foundation of China (NSFC) has seen its 
annual budget grow from CNY 80 million to 
28.04 billion in 2018, as China underwent rap- 
id transformation and became a global player 
in research. Now under the new leadership of 
President Li Jinghai, the NSFC has engaged in con- 
sultations about its future. To promote international 
dialogue, the NSFC convened representatives from 15 
funding agencies around Europe to solicit feedback on 
proposed reforms. This September meeting in Paris 
marked a first major step by the 
NSFC to align its new strategies 
and policies with those of interna- 
tional partners and demonstrate 
its commitment to cooperation. 

In the course of China’s recent 
government reorganization, the 
NSFC was affiliated with the Min- 
istry of Science and Technology, 
which raised concerns about the 
NSFC’s independence. There has 
been speculation about possible 
changes since the restructuring 
was announced, but little detail 
was provided until the outline 
of the reform was unveiled in Paris. In that context, 
the message was that the NSFC will keep its due 
independence. 

Reforms carried out by the NSFC reflect shifts in 
science: changing global science landscapes, the im- 
portance of transdisciplinarity, the combination of ap- 
plied and basic research, and the interplay between 
research and innovation. To adapt to such future 
needs, the NSFC identifies four funding categories: 
(i) curiosity-driven disruptive research, (ii) burning 
problems at the frontiers of research, (iii) excellent sci- 
ence applied to economic and social demands, and (iv) 
transdisciplinary research dealing with grand chal- 
lenges. The NSFC is considering a special division with 
tasks cutting across disciplinary divisions. All funding 
agencies pointed to the challenge of keeping “scientific 
excellence” as a dominant criterion. 

Each of the proposed funding categories will require 
specific evaluation approaches. The NSFC is review- 


“COOPERATION AND 
EXCHANGE BETWEEN 
FUNDING AGENCIES 
HAVE HUGE POTENTIAL 
TO STRENGTHEN 
INTERNATIONAL SCIENCE.” 


ing its evaluation mechanisms following principles of 
responsibility, credit, and contribution of reviewers— 
avoiding nonacademic biases and conflicts of interest; 
assessing reviewers’ performance; and contributing 
constructive feedback to proposers. Evaluating trans- 
disciplinary proposals and finding reviewers with the 
necessary skills is challenging. The NSFC plans to test 
artificial intelligence-assisted approaches for identify- 
ing reviewers. 

International collaboration with foreign funders 
is a priority of the NSFC. Regularly, partners are in- 
volved in joint calls for proposals for research projects, 
exchange initiatives, and bilateral 
or multilateral workshops. For 
international joint peer reviews, 
rules are commonly agreed on 
and implemented successfully. 
But strict and differing national 
rules and some inflexibility of in- 
ternal procedures—for example, 
regarding eligibility of propos- 
ers—are hindering alignment 
and simplification. Participants 
agreed that this is an area in need 
of future joint considerations. 

Following the trend from bilat- 
eral to multilateral collaboration, 
earlier this year the NSFC and the Joint Programming 
Initiative Urban Europe launched a pilot joint call on 
sustainable urbanization involving nine European 
funding agencies. Upcoming evaluation of this effort 
will provide guidance for future multilateral initiatives 
between China and Europe. 

Contacted after the workshop, President Qiu Yong of 
Tsinghua University commended the reform as “criti- 
cally important” for science in China, adding “I will 
give my full support to the reform as a scientist.” For 
the European agencies, the NSFC’s reform provided 
inspiration for their own future development as well 
as for their cooperation with the NSFC. Cooperation 
and exchange between funding agencies have huge 
potential to strengthen international science. Open 
spaces as provided by the NSFC in Paris are needed 
for jointly shaping funding and performing science in 
global settings. 

-Manfred Horvat 
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464 We are not anti-gun. We are anti-bullet hole. ... 


Join us, or move over! This is our lane. 99 


Open letter signed by more than 30,000 clinicians after the National Rifle 
Association tweeted that doctors should “stay in their lane” on gun violence. 


IN BRIEF ; 
Edited by Lila Guterman 


DISASTERS 


Trump’s tweet ignites clash over fires 


Wildfires have struck semiurban areas of California, including this part of the San Fernando Valley. 


s California struggles with some of the worst wildfires in its 
history, fire experts are panning U.S. President Donald Trump’s 
comments about the causes. At least 44 people had died, and 
some 200 were missing, in three fires as Science went to press. 
In a 10 November tweet, Trump blamed the state government 
for “gross mismanagement of the forests.” Many researchers 
were quick to point out that federal officials manage two-thirds of 
California’s forests, and that the blazes mainly struck semiurbanized 
areas and shrublands. “The most destructive and deadly fires in CA 
are NOT in forests. This is grossly irresponsible and uninformed,’ 
tweeted wildfire scientist Crystal Kolden of the University of Idaho 
in Moscow. Others highlighted the role of climate change in creating 
the hot and dry conditions that favored fire. Ironically, Trump’s com- 
ments came just weeks after a group of prominent fire scientists, the 
Fire Research Consensus working group, issued a report concluding 
that climate change is “a strong driver of fire occurrence,” and that 
climate and weather are the “primary drivers of fire size.” 
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Europe studies hormone mimics 


PoLicy | The European Commission has 
pledged to speed up research on endocrine 
disruptors under its next science funding 
program, beginning in 2021. This pledge is 
part of a broader plan published on 

7 November to protect human health, ani- 
mals, and the environment from chemicals 
that alter the body’s balance of hormones. 
Since its previous strategy, in 1999, the 
commission has spent about €150 million 
on such research, and allocated another 
€52 million to studying testing methods. 
New research would focus on studying 
whether a “safe threshold” can be estab- 
lished, the “cocktail effect” of exposure to 
multiple disruptors, and the development 
of safer alternatives. The Endocrine Society 
says more research is welcome, but argues 
that completed research warrants stricter 
limits to exposure to these chemicals, found 
in products such as pesticides and plastics. 


Areprieve for tigers and rhinos 


WILDLIFE CONSERVATION | After saying 
last month that it would allow the use of 
tiger and rhinoceros bones and tissues “in 
medical research and healing,” the Chinese 
government has backpedaled. In a move 
cheered by conservationists, China re- 
instated its 1993 ban—at least temporarily. 


Uganda deploys Ebola vaccine 


PUBLIC HEALTH | The persistent Ebola 
outbreak underway in the conflict-ridden 
northeastern region of the Democratic 
Republic of the Congo (DRC) has led 
neighboring Uganda to start to vaccinate 
frontline health care workers. This is the 
first time the experimental Ebola vaccine 
is being used in a country without an out- 
break. More than 2000 Ugandan health 
workers who live near the border will 
receive the vaccine, which worked well in 
a trial in Guinea in 2015, during the mas- 
sive West African epidemic. Because the 
vaccine is unlicensed, Uganda’s Ministry 
of Health had to approve its use. The DRC 
has vaccinated more than 29,000 people 
despite an armed insurgency that has 
repeatedly disrupted its work. Since 
Ebola surfaced in the DRC in August, 
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WILDLIFE CONSERVATION 


Amazon turtles bounce back 


he threatened giant South American 

turtle (Podocnemis expansa) has made 

a robust recovery on river beaches 

in the Brazilian Amazon, thanks to 

round-the-clock protection during its 
breeding season. There has been a more 
than ninefold increase in turtle hatchlings 
on beaches along a 1500-kilometer section 
of one of the Amazon River's tributaries 
since the community conservation effort 
began in 1977, a team led by research- 
ers from the University of East Anglia in 
Norwich, U.K., reports this week in Nature 
Sustainability. The result was equivalent to 
adding 70,000 turtle hatchlings each year. 
Poachers seeking to illegally harvest meat 
and eggs of the turtle, which can grow to a 
meter, attacked just 2% of the 2000 nests 
on guarded beaches, compared with 99% of 
the 202 nests on unprotected beaches. The 
surveillance also provided unintended—and 
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welcome—boosts to the populations of 
other species, including large-billed terns, 
green iguanas, and black caimans. 


more than 300 people have become ill, 
nearly two-thirds of whom have died. 


Once-wet Mars site targeted 


MARS ROVER | Researchers have picked 
a landing site for the European-Russian 
ExoMars 2020 rover. Called Oxia Planum, 
the equatorial site contains clay-rich 
minerals likely formed in a large body of 
water, and channels that may have been 
cut by water some 4 billion years ago, 
when Mars was wetter. ExoMars, a joint 
project of the European Space Agency 
(ESA) and Russia’s Roscosmos, is due to 
arrive in 2021. A drill, delving 2 meters 
below the surface, will extract samples 
for analysis. The mission’s site selection 


Ellipses mark the chosen landing zone for ExoMars at Oxia Planum. 


SCIENCE sciencemag.org 


working group spent 5 years narrowing 
down eight candidate sites before picking 
Oxia Planum. Strongly in its favor was the 
lack of steep slopes and large boulders, 
which might have endangered the landing. 
ESA and Roscosmos will review the site 
before confirming it in 2019. 


Nearby star has an icy planet 


EXOPLANETS | The solar system has more 
company in the stellar neighborhood. After 
the 2016 discovery of a planet orbiting 
Proxima Centauri, the nearest star to the 
sun at 4 light-years away (Science, 26 August 
2016, p. 857), researchers have now found 

a strong candidate around Barnard’s Star, 
just 6 light-years from home. The new study 
identified a faint wobble in the 
motion of Barnard’s Star with a 
233-day period—the signature of 
the tug of a planet at least 

3.2 times Earth’s mass. The 
study, published this week in 
Nature, drew on three tele- 
scopes’ observations of the star’s 
wobble as well as 20 years of 
archival data. The planet may 
be rocky, but its small, dim star 
may not shine brightly enough 
for surface water to be liquid, 
making the presence of 

life unlikely. 
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Keystone pipeline on hold 


ENERGY | A federal judge has, at least 
temporarily, blocked an effort by President 
Donald Trump’s administration to 
complete the Keystone XL pipeline, which 
would carry oil 1900 kilometers from 
central Canada to U.S. refineries. On 

9 November, federal District Court Judge 
Brian Morris of Montana ruled that 

an environmental assessment by the 

U.S. Department of State “fell short” of a 
required “hard look” at the project’s effects 
on climate change, Native American lands, 
and the broader environment. No work 

on the pipeline’s final section, from Canada 
to Nebraska, can proceed until the depart- 
ment corrects flaws in its analysis, he 
ruled. It is the latest setback for Keystone 
XL, which had been blocked by former 
President Barack Obama at climate 
activists’ urging before Trump resurrected 
it early in his presidency. 


Farms need to fight bovine TB 


WILDLIFE MANAGEMENT | A contentious 
debate rages in the United Kingdom over 
whether badgers must be killed in order 
to slow the spread of tuberculosis (TB) 
in cattle, a disease that costs farmers and 
taxpayers about £120 million a year. 

A new review of the issue, released on 
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12 November, finds badgers partly to blame 
and says culling has a reasonable chance 

of helping stop disease spread. But “it is 
wrong to put all the blame on wildlife,’ said 
population biologist Charles Godfray of the 
University of Oxford in the United Kingdom, 
an author of the review. Far more cases of 
TB result from transmission between cattle 
than from badgers, the review notes, so it 
urges the government and farmers to do 
more to control bovine TB on farms. For 
example, cattle already must be tested for 
TB before they are moved from high-risk 
areas, but the review suggests switching to 
a test with fewer false negatives. 


Open-access plan draws outcry 


PUBLISHING | Scientists are pushing back 
against Plan S, the scheme to end schol- 
arly journals’ paywalls, launched 2 months 
ago by 11 national research funders in 
Europe. In an open letter published on 

5 November, more than 1000 signatories 
say they support open access—making 
papers available free to all online—but 
condemn Plan S as “too risky for science.” 


MUSEUMS 


The letter slams the plan’s proposed 
crackdown on hybrid journals, saying 

it would restrict access to high-quality, 
rigorous journals published by scientific 
societies. Hybrid publications earn rev- 
enue from both reader subscription 

fees and article processing charges paid 
by authors who want to make their 
papers immediately accessible. The letter 
also warns that Plan S would endanger 
collaborations between grantees of 

Plan S funders and scientists still allowed 
to publish in paywalled journals. 


Measles hits New York City 


PUBLIC HEALTH | Measles-infected travel- 
ers returning from Israel have caused 

two outbreaks of the highly contagious 
disease in Orthodox Jewish communities 
in and near New York City. Between late 
September and 13 November, 92 cases 
were reported. Public health authorities 
report that suburban Rockland County has 
had 68 cases of measles in unvaccinated 
or undervaccinated children, teens, and 
adults. In Brooklyn, 24 people, at least 


2 eee 


Berlin museum flush with funding 


erlin’s Natural History Museum received word of a windfall last week: €660 million 

over 10 years from local and federal governments. The money, among the largest 

sums ever pledged to a natural history museum, will allow it to restore buildings 

still damaged from World War II, preserve and digitize collections, and build a new 

research campus. The research campus will focus on biodiversity, environmental 
science, and science communication. Museum Director General Johannes Vogel had 
lamented the state of the Berlin buildings and collections in September, after the devas- 
tating fire at Brazil's National Museum in Rio de Janeiro. He now says he and his staff are 
“over the moon” at the news, announced on 6 November. 


730 16 NOVEMBER 2018 + VOL 362 ISSUE 6416 


Published by AAAS 


17 of them aged 7 months to 4 years old, 
had confirmed cases as of 13 November. 
Israel is experiencing a large measles 
outbreak, with 1401 cases this year through 
6 November, 735 of them confirmed in 
October. The European Union recorded 
13,453 cases of measles and 37 deaths in 
the 12 months that ended on 30 September. 
In 2017, there were 120 cases in the United 
States. This year there have been 220 cases 
through 3 November. 


NIH’s racial gap partly explained 


FUNDING DISPARITIES | Publication his- 
tory explains roughly one-quarter of the 
gap in success rates between black and 
white researchers who apply for National 
Institutes of Health (NIH) funding, accord- 
ing to a study published this week in PLOS 
ONE. A team led by Donna Ginther, an 
economist at the University of Kansas 

in Lawrence, added detailed publication 
information to data in a 2011 Science paper 
by Ginther’s team, which found that black 
applicants’ chances of winning an RO1 
grant were 13 percentage points lower than 
white applicants: Using 2397 applications 
submitted to NIH between 2002 and 2006, 
the team now finds that the gap narrows if 
they account for the quality of publications, 
using metrics such as journal impact factor 
and fraction of first-authored publications. 
The results suggest “that the role of bias is 
probably smaller” than was feared after the 
2011 results came out, Ginther says. 


Clinical trial disappoints 


RARE DISEASES | A molecule hailed as a 
possible treatment for Niemann-Pick type 
C, an extremely rare and ultimately fatal 
neurodegenerative disease, performed no 
differently from placebo in a pivotal trial 
in 56 children and youths, Mallinckrodt 
Pharmaceuticals, based in Staines-upon- 
Thames, U.K., announced last week. 
Perplexingly, the disease did not progress 
in either the placebo group or in patients 
treated with the drug, VTS-270, during the 
1-year study. Some researchers say such 
traditional double-blind, randomized con- 
trolled trials are likely inappropriate for 
extremely rare conditions. VTS-270, 

a sugar known as a cyclodextrin, was seen 
as one of the most promising collabora- 
tions between industry and the National 
Center for Advancing Translational 
Sciences in Bethesda, Maryland, part 

of the National Institutes of Health 
(Science, 7 October 2016, p. 18). 
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U.S. ELECTIONS 


Kim Schrier, a pediatrician, 
pla p her technical 
ining in a winning 
congressional campaign. 


Vote heralds fresh start for science panel 


Democratic win of House of Representatives will bring new tone to research issues 


By Jeffrey Mervis 


he results of last week’s divisive mid- 

term elections, with Democrats re- 

claiming control of the U.S. House 

of Representatives and Republicans 

likely strengthening their hold on the 

Senate, have allowed both parties to 
claim victory. U.S. scientists are also experi- 
encing mixed emotions. 

Many are pleased with what they expect 
to be a more data-driven approach to sci- 
ence policy under the new Democratic 
chair of the House science committee. But 
they also face the sobering reality that, by 
Science’s count, only seven of the 49 House 
candidates with technical backgrounds 
were victorious. And environmental ac- 
tivists are chagrined by the defeat of a 
proposed tax on carbon emissions in Wash- 
ington and an Arizona initiative to increase 
that state’s reliance on renewable energy, 
although Nevada voters took a first step to- 
ward adopting a similar policy. 

In the House, Democrats picked up nearly 
40 seats. That outcome gives them control 
of the 435-seat body for the first time since 
2010, meaning they will appoint committee 
chairs and decide which bills get a vote. 

Representative Eddie Bernice Johnson 
(D-TX) is in line to replace the retiring Rep- 
resentative Lamar Smith (R-TX) as chair of 
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the science committee. The two Texans repre- 
sent a stark contrast. Trained as a psychiatric 
nurse, Johnson has promised to “restore the 
credibility” of a committee that for 6 years 
has challenged the findings of climate scien- 
tists and questioned the need for many envi- 
ronmental regulations. 

“We were not really following our charter 
[under Smith],” says Johnson, who joined the 
panel as a new legislator in 1993 and for the 
past 8 years has been its top Democrat. In- 
stead, she says, “We were trying to uncover 


The new STEM Democrats 


Seven candidates with science backgrounds won seats 
last week in the U.S. House of Representatives. 


TRAINING NAME STATE 
Biochemical Sean Casten Illinois 
engineer 
Ocean Joe Cunningham South Carolina 
engineer 
ndustrial Chrissy Houlahan Pennsylvania 
engineer 
uclear Elaine Luria Virginia 
engineer 
Pediatrician Kim Schrier Washington 
urse Lauren Underwood Illinois 
Dentist Jeff Van Drew New Jersey 
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any information that would undercut scien- 
tific findings and avoid facing what the scien- 
tific data were showing us.” 

Smith, a lawyer who came to Congress in 
1989, regularly convened hearings designed 
to highlight the views of those opposed to 
federal action to curb greenhouse gas emis- 
sions. He also used his unilateral power to 
issue investigative subpoenas—an authority 
traditionally given to just a few committee 
heads—to attack climate science he found 
suspect. Johnson hopes to shift the debate 
from “ignoring what’s happening” to discuss- 
ing “what we should be doing to save our 
planet and the lives and money it takes to 
clean up after weather-related disasters.” 

That move and other changes in tone could 
help repair a breach between the panel and 
the scientific community. “Stakeholders have 
told me they stopped asking for meetings 
[with the Republican majority] because they 
didn’t see the point,’ says one Democratic 
staffer. “That’s going to change, because we 
will be listening.” 

All seven winners with technical back- 
grounds are Democrats, and six were first- 
time candidates. Two toppled Republican 
incumbents; the rest won open seats. Four 
are women—a pediatrician, a nurse, an in- 
dustrial engineer, and a retired U.S. Navy 
commander—helping boost overall female 
representation in the House to nearly 25%. 
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Newly elected lawmakers rarely get ap- 
pointed to the appropriations committee 
and other panels with influence over key 
sectors of the economy, such as tax and fis- 
cal policy. Accordingly, they are often over- 
represented on the science committee. But 
none of the soon-to-be House members 
with technical backgrounds is lobbying for 
a spot on the science panel. 

“T don’t know enough at this point about 
what the science committee does to have 
an opinion,” says Representative-elect Sean 
Casten (D-IL), a biochemical engineer who 
founded a company that helps firms be- 
come more energy efficient and who de- 
feated Representative Peter Roskam (R) ina 
suburban Chicago district. “While I worked 
in basic science for half a dozen years in my 
youth, I feel more confident in my ability 


Representative-elect Chrissy Houlahan 
(D-PA), who won an open seat in the 
Philadelphia suburbs, says her training as 
an industrial engineer is just one of many 
facets of her identity. “I’m a veteran, an 
entrepreneur, a mom, and an educator as 
well,” says Houlahan, who helped her hus- 
band grow a sports apparel company and 
briefly taught high school chemistry before 
leading a foundation that promotes early 
literacy. “I feel that I am part of a wave of 
people elected who provide diversity on a 
lot of levels.” 

Climate change is an existential issue 
for two new members representing coastal 
districts. In South Carolina, Representative- 
elect Joe Cunningham (D), an ocean engi- 
neer turned environmental lawyer, ham- 
mered his opponent for voicing support of 


Representative Eddie Bernice Johnson (D-TX) is in line to lead the House of Representatives science committee. 


to deploy and apply basic science than to 
create it. So committees that deal with in- 
frastructure and financial services, energy, 
and environmental policy are closer to ar- 
eas where I can apply my skills.” 

Representative-elect Lauren Underwood 
(D), who ousted Representative Randy 
Hultgren (R) in a north-central Illinois dis- 
trict, hopes to apply her background as a 
nurse and health care analyst to win a seat on 
one of two panels that oversee federal health 
care policy. That’s also true for Representative- 
elect Kim Schrier (D-WA), a pediatrician who 
won an open seat outside of Seattle. 

“Health care is where people are really 
hurting now,’ Schrier says. “I felt I could re- 
ally lend my expertise to finding better ways 
of providing it that bring costs down and 
improve outcomes. ... I’m also really excited 
to be the only woman doctor in Congress 
at a time when women’s reproductive rights 
are being attacked.” 


732 16 NOVEMBER 2018 + VOL 362 ISSUE 6416 


President Donald Trump’s plan to lift a ban 
on offshore drilling along the Atlantic coast, 
a pivotal issue for his constituents. 
Representative-elect Elaine Luria (D-VA) 
says her 20-year career in the Navy helps 
her understand both the civilian and mili- 
tary components of sea-level rise. And she 
thinks the public is already on board. “Peo- 
ple see our roads flooding and the sea level 
rising,” she says about her southeastern Vir- 
ginia district. “I have yet to talk to anyone 
who doesn’t think climate change is real.” 
Before these new members can take their 
seats in January 2019, the current class of 
legislators must finish work on a spending 
bill for the 2019 fiscal year that began on 
1 October. An earlier agreement to increase 
overall spending in 2018 and 2019 allowed 
Congress to pass budgets for about two- 
thirds of the government, including the 
National Institutes of Health (NIH) and the 
Department of Energy. But budgets for the 
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remaining agencies, including NASA, the 
National Science Foundation (NSF), and 
several science agencies within the Depart- 
ment of Commerce, have been frozen un- 
der a continuing resolution that expires on 
7 December. Disagreement over Trump’s 
request to build a wall between the United 
States and Mexico stands in the way of a fi- 
nal deal by the lame-duck Congress. 

The annual battle over spending could 
intensify next year. The divided Congress 
will have to deal with a 2011 law aimed 
at reducing the federal deficit over a de- 
cade. That law imposes spending caps, and 
could force lawmakers to cut a combined 
$126 billion from civilian and military 
budgets unless the Democratic House and 
Republican-controlled Senate can broker a 
deal to raise the caps. 

Some legislators long associated with sci- 
ence issues won’t be around for those de- 
bates. Senator Bill Nelson (D-FL), a NASA 
enthusiast who once flew aboard the space 
shuttle, appears to have lost his bid for re- 
election. In the House, the losers included 
Representative John Culberson (R-TX), who 
chairs a House appropriations subcommit- 
tee that sets spending levels for several sci- 
ence agencies, including NASA and NSF, and 
has pushed for NASA to develop a mission 
to Jupiter’s moon Europa. The science com- 
mittee will lose Hultgren, a cheerleader for 
basic energy research, as well as Represen- 
tative Barbara Comstock (R-VA), who chairs 
the research subcommittee, and Representa- 
tive Dana Rohrabacher (R-CA), a persistent 
doubter of climate science. 

The frontrunner to take Culberson’s spend- 
ing gavel is Representative José Serrano (D- 
NY), an advocate for science with a special 
interest in the Census Bureau. Representative 
Nita Lowey (D-NY) is poised to lead the full ap- 
propriations committee, and Representative 
Rosa DeLauro (D-CT) is the favorite to lead 
the subcommittee that oversees NIH. Both 
have been supportive of federal investments 
in research. 

Meanwhile, some science candidates who 
didn’t win last week see a silver lining. Randy 
Wadkins, a professor of biochemistry at the 
University of Mississippi in Oxford, was the 
only academic scientist to make it onto the 
general election ballot. And although he lost 
by a two-to-one margin to an incumbent Re- 
publican, he says his campaign “might have 
been the most important thing I’ve ever 
done in my life, science-wise.” 

Seeking a House seat gave him a platform 
to connect with people “who were interested 
in science and wanted to do something,” he 
says. “A lot of us lost. But some of us won. 
And that’s my take-home message: This isn’t 
the end of scientists running for Congress; 
it’s just the beginning.” 
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Eruption made 536 ‘the worst year to be alive’ 


Core from glacier reveals the Icelandic volcano that plunged Europe into darkness 


By Ann Gibbons 


sk medieval historian Michael 

McCormick what year was the worst 

to be alive, and he’s got an answer: 

“536.” Not 1349, when the Black 

Death wiped out half of Europe. Not 

1918, when the flu killed 50 million to 
100 million people, mostly young adults. 
But 536. In Europe, “It was the beginning 
of one of the worst periods to be alive, if not 
the worst year,’ says McCormick, a historian 
and archaeologist who chairs the Harvard 
University Initiative for the Science of the 
Human Past. 

A mysterious fog plunged Europe, the 
Middle East, and parts of Asia into darkness, 
day and night—for 18 months. “For the sun 
gave forth its light without brightness, like 
the moon, during the whole year,’ wrote Byz- 
antine historian Procopius. Temperatures in 
the summer of 536 fell 1.5°C to 2.5°C, initiat- 
ing the coldest decade in the past 2300 years. 
Snow fell that summer in China; crops failed; 
people starved. The Irish chronicles record 
“a failure of bread from the years 536-539.” 
Then, in 541, bubonic plague struck the Ro- 
man port of Pelusium, in Egypt. What came 
to be called the Plague of Justinian spread 
rapidly, wiping out one-third to one-half of 
the population of the eastern Roman Empire 
and hastening its collapse, McCormick says. 

Historians have long known that the mid- 
dle of the sixth century was a dark hour in 
what used to be called the Dark Ages, but 


Slivers from a Swiss ice core held chemical clues to 
natural and humanmade events. 


the source of the mysterious clouds has long 
been a puzzle. Now, an ultraprecise analysis 
of ice from a Swiss glacier by a team led by 
McCormick and glaciologist Paul Mayewski 
at the Climate Change Institute of The Uni- 
versity of Maine (UM) in Orono has fingered 
aculprit. At a workshop at Harvard this week, 
the team reported that a cataclysmic volca- 
nic eruption in Iceland spewed ash across 
the Northern Hemisphere early in 536. Two 
other massive eruptions followed, in 540 and 
547. The repeated blows, followed by plague, 
plunged Europe into economic stagnation 
that lasted until 640, when another signal in 
the ice—a spike in airborne lead—marks a re- 
surgence of silver mining, as the team reports 
in Antiquity this week. 


To Kyle Harper, provost and a medieval 
and Roman historian at The University of 
Oklahoma in Norman, the detailed log of 
natural disasters and human pollution frozen 
into the ice “give us a new kind of record for 
understanding the concatenation of human 
and natural causes that led to the fall of the 
Roman Empire—and the earliest stirrings of 
this new medieval economy.” 

Ever since tree ring studies in the 1990s 
suggested the summers around the year 540 
were unusually cold, researchers have hunted 
for the cause. Three years ago polar ice cores 
from Greenland and Antarctica yielded a 
clue. When a volcano erupts, it spews sulfur, 
bismuth, and other substances high into the 
atmosphere, where they form an aerosol veil 
that reflects the sun’s light back into space, 
cooling the planet. By matching the ice re- 
cord of these chemical traces with tree ring 
records of climate, a team led by Michael Sig], 
now of the University of Bern, found that 
nearly every unusually cold summer over the 
past 2500 years was preceded by a volcanic 
eruption. A massive eruption—perhaps in 
North America, the team suggested—stood 
out in late 535 or early 536; another followed 
in 540. Sigl’s team concluded that the double 
blow explained the prolonged dark and cold. 

Mayewski and his interdisciplinary team 
decided to look for the same eruptions in an 
ice core drilled in 2013 in the Colle Gnifetti 
Glacier in the Swiss Alps. The 72-meter-long 
core entombs more than 2000 years of fall- 
out from volcanoes, Saharan dust storms, 


Darkest hours and then a dawn 


Ahigh-resolution ice core record combined with historical texts chronicles the impact of natural disasters on European society. 


536 536-545 
Icelandic volcano — Coldest decade on record 
erupts, dimming the in 2000 years. Crops fail 


540-541 
Second volcanic 
eruption. Summer 


541-543 
The “Justinian” bubonic 
plague spreads through the 


640 

After declining in the 
mid-500s, a surge 

in atmospheric lead 
signals an increase in 


660 
Asecond lead peak 
reflects silver mining, 
probably at Melle, France, 
tied to a switch from gold 
silver mining because to silver for coins and 
of economic recovery. the beginnings of the 

| medieval economy. 
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and human activities smack in the center 
of Europe. The team deciphered this record 
using a new ultra-high-resolution method, 
in which a laser carves 120-micron slivers of 
ice, representing just a few days or weeks of 
snowfall, along the length of the core. Each 
of the samples—some 50,000 from each me- 
ter of the core—is analyzed for about a dozen 
elements. The approach enabled the team to 
pinpoint storms, volcanic eruptions, and lead 
pollution down to the month or even less, go- 
ing back 2000 years, says UM volcanologist 
Andrei Kurbatov. 

In ice from the spring of 536, UM graduate 
student Laura Hartman found two micro- 
scopic particles of volcanic glass. By bom- 
barding the shards with x-rays to determine 
their chemical fingerprint, she and Kurbatov 
found that they closely matched glass par- 
ticles found earlier in lakes and peat bogs in 
Europe and in a Greenland ice core. Those 
particles in turn resembled volcanic rocks 
from Iceland. The chemical similarities con- 
vince geoscientist David Lowe of The Univer- 
sity of Waikato in Hamilton, New Zealand, 
who says the particles in the Swiss ice core 
likely came from the same Icelandic volcano. 
But Sig] says more evidence is needed to con- 
vince him that the eruption was in Iceland 
rather than North America. 

Either way, the winds and weather systems 
in 536 must have been just right to guide the 
eruption plume southeast across Europe and, 
later, into Asia, casting a chilly pall as the vol- 
canic fog “rolled through,’ Kurbatov says. The 
next step is to try to find more particles from 
this volcano in lakes in Europe and Iceland, 
in order to confirm its location in Iceland and 
tease out why it was so devastating. 

A century later, after several more erup- 
tions, the ice record signals better news: the 
lead spike in 640. Silver was smelted from 
lead ore, so the lead is a sign that the pre- 
cious metal was in demand in an economy 
rebounding from the blow a century before, 
says archaeologist Christopher Loveluck of 
the University of Nottingham in the United 
Kingdom. A second lead peak, in 660, marks 
a major infusion of silver into the emergent 
medieval economy. It suggests gold had be- 
come scarce as trade increased, forcing a 
shift to silver as the monetary standard, 
Loveluck and his colleagues write in Antig- 
uity. “It shows the rise of the merchant class 
for the first time,” he says. 

Still later, the ice is a window into another 
dark period. Lead vanished from the air dur- 
ing the Black Death from 1349 to 1353, re- 
vealing an economy that had again ground 
to a halt. “We’ve entered a new era with this 
ability to integrate ultra-high-resolution 
environmental records with similarly high 
resolution historical records,’ Loveluck says. 
“Tt’s a real game changer.” 
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Obesity gives unexpected 
boost to anticancer drugs 


Drugs revive immune cells exhausted by obesity hormone 


By Jocelyn Kaiser 


econd only to smoking, obesity is a 

top risk factor for cancer. But onco- 
logists have noticed something sur- 
prising: Overweight patients some- 
times respond better than other 
patients to powerful drugs that har- 

ness the immune system to fight tumors. 
Now, researchers tracing the complex effects 
of obesity on cancer are glimpsing a possible 
explanation: Obesity weakens the immune 
system and favors tumor growth by boosting 
the very same molecules those drugs target. 
“For the most part, everybody assumes 
obesity is always bad. But [with these drugs], 
there was a net positive,” says cancer immuno- 
logist William Murphy of 
the University of Califor- 
nia (UC), Davis, who, with 
UC Davis oncologist Arta 


“.. everybody 


grow faster in obese mice, his team stud- 
ied the T cells of obese mice, monkeys, and 
people. They found that the cells were what 
immunologists call “exhausted.” They were 
slow to proliferate and had stopped making 
secreted proteins that stimulate other im- 
mune system helpers. They also displayed 
more PD-1 than average, meaning cancer 
cells could more easily suppress them and 
grow unhindered. 

Leptin, a hormone made by fat cells, is 
one factor in the PD-1 excess, Murphy’s 
group found. Overweight animals and peo- 
ple produce high levels of the hormone, 
which normally signals the brain that the 
body has had enough to eat. But leptin 
also affects the immune system, and the 
UC Davis team suspects it 
triggers a signaling path- 
way that increases PD-1 
on T cells. The PD-1 excess 


ora fei a oe assumes obesity en a ec men 
reported this week in Na- e efit, Murphy’s team found: 
ture Medicine. Murphy us alway: S bad. In obese mice, it makes T 
ieee the eee Ras But [with these oo. Aig uterine 
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cancer patients. a net positive.” brake, the T cells sprang 


Called checkpoint inhibi- 
tors, the drugs block the ac- 
tivation of PD-1, a protein on 
the surface of immune senti- 
nels called T cells. The body naturally triggers 
PD-1 to dampen immune responses, but tu- 
mors can also stimulate PD-1 to protect them- 
selves. Lifting this molecular “brake” allows 
the T cells to attack the cancer cells. PD-1 
inhibitors have caused untreatable tumors 
to vanish for years in people with melanoma, 
lung cancer, and some other cancer types. 

But only a minority of patients respond 
to the drugs, and a study this year in The 
Lancet Oncology showed that the respond- 
ers disproportionately include people who 
are overweight. In 330 advanced melanoma 
patients given a PD-1 inhibitor, researchers 
at MD Anderson Cancer Center in Houston, 
Texas, found that those who were male and 
overweight lived much longer on average: 
nearly 27 months compared with 14 months 
for patients with a normal body mass index. 

Now, Murphy’s team has firmed up this 
clinical observation in the lab and identified 
a possible basis. After confirming that tumors 
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William Murphy, 
University of California, Davis 


back into action. Nour- 
ished by glucose and other 
nutrients abundant in an 
overweight animal’s_ tis- 
sues, they worked better at curbing tumors 
than in normal weight animals. 

The finding suggests an “unexpected” ben- 
efit of obesity for cancer patients, says Har- 
vard University immunologist Lydia Lynch. 
Her group reports in Nature Immuno- 
logy this week on a different way obesity im- 
pairs the immune system’s ability to attack 
tumors: by hampering a type of immune 
cell called natural killer cells that seek out 
and destroy abnormal cells. 

Murphy plans to explore whether briefly 
giving normal weight mice with cancer 
a high-fat diet in order to mimic some ef- 
fects of obesity could boost their response 
to PD-1 inhibitors. But such treatments for 
cancer patients could have harmful effects, 
cautions tumor immunologist Suzanne 
Ostrand-Rosenberg of The University of 
Utah in Salt Lake City, who also studies how 
obesity spurs tumor growth. “It’s a balance 
here, a very careful balance,” she says. 
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Fresh fights roil evidence- 
based medicine group 


Expelled Cochrane co-founder Peter Getzsche is suspended 
as head of the Nordic Cochrane Centre 


By Gretchen Vogel 


messy clash within Cochrane, an 

international network that promotes 

evidence-based medicine, is spiraling 

into what some see as a battle over 

the organization’s character and pur- 

pose. Last week, prominent pharma 
critic Peter Getzsche failed in a bid to sever 
the Nordic Cochrane Centre in Copenhagen, 
which he led, from the wider international 
organization. Instead, Gotzsche was sus- 
pended by the Rigshospitalet in Copenhagen, 
which hosts the center. 

The move triggered protests from within 
the organization, which had expelled Gotzsche 
2 months ago. At stake is not just Gotzsche’s 
job or his center, but the extent to which Co- 
chrane can tolerate debate and dissent, says 
Gerd Antes of Ludwig Maximilian University 
in Munich, Germany, a former head of the 
German Cochrane Centre in Freiburg. “It’s 
more than a mess, it’s now a whole scandal,” 
Antes says. 

Cochrane publishes systematic reviews 
of the evidence for the efficacy of drugs, 
vaccines, diagnostics, medical procedures, 
and other health-related interventions. 
The organization has centers or groups in 
43 countries and a central office in Lon- 
don. Ggtzsche, a controversial figure who 
helped found Cochrane in 1993, was elected 
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to its governing board in January 2017. 
Marguerite Koster, a co-chair of the board 
and a senior manager at Kaiser Permanente 
in Los Angeles, California, says there had 
been complaints against him for years, in- 
cluding several recently about his use of the 
Cochrane name in letters unrelated to the 
organization. The conflict escalated in June, 
when Gotzsche and two co-authors published 
a scathing attack on a Cochrane review about 
the efficacy of human papillomavirus vac- 
cines. (They said the review “was incomplete 
and ignored important evidence of bias.”) 

Gotzsche’s expulsion in September for 
what the board called “a consistent pattern of 
disruptive and inappropriate behaviours” led 
four other board members to resign in pro- 
test. Gotzsche vowed to take his center out of 
the international network, but the board said 
he had no right to do so. With his suspen- 
sion from the Rigshospitalet, it appears the 
board has won—at least for now. A statement 
by the hospital’s deputy chief executive, Per 
Jorgensen, said it is “striving to ensure that 
the Nordic Cochrane Centre continues as part 
of the international Cochrane collaboration.” 
“The only reason that they gave was that they 
no longer had confidence in my leadership,” 
says Gotzsche, who is working with a lawyer 
to challenge the suspension. 

A 4 November petition triggered by ru- 
mors that Gotzsche was about to be let 
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Peter Ggtzsche’s clashes with Cochrane have 
sparked calls for policy changes. 


go asks Danish Minister of Health Ellen 
Trane Norby to “reconsider this possible 
dismissal.” It argues that Getzsche’s work 
has “played a pivotal role in favor of the 
transparency of clinical data, the prior- 
ity of public health needs and the defense 
of rigorous medical research carried out 
independently of conflicts of interest.” As 
Science went to press, more than 6000 
people had signed the petition, including 
The BMJ Editor-in-Chief Fiona Godlee and 
Cochrane co-founder Iain Chalmers. “This 
shows how wrong the board is,” Antes says. 

Antes and 14 other Cochrane directors 
and editors have put together a second pe- 
tition criticizing a lack of transparency at 
the organization and the board’s statement 
that there will be “zero tolerance for bad be- 
havior” in the wake of Gotzsche’s expulsion. 
“We fear this will easily lead to ‘zero toler- 
ance for different opinions,” they write. 
Getzsche “has made mistakes,” Antes says, 
but, “you need people like Peter Ggtzsche. 
Otherwise you get a bit too stiff” 

The signatories also call for policy 
changes, including using revenue from 
Cochrane publications to support the pro- 
duction of reviews, rather than the group’s 
headquarters, and moving toward an open- 
access publishing model. The current busi- 
ness model, based on selling reviews as 
products, is a source of tension, says David 
Hammerstein Mintz, a former member of 
the European Parliament from Spain and 
one of the board members who resigned. 
Cochrane’s centers and groups raise their 
own funds, usually from government 
sources, he notes. “Many members don’t see 
their work as products. It’s a public good.” 

Cochrane CEO Mark Wilson says the 
board is preparing a response to the criti- 
cisms. He says the gist will be “We wel- 
come members’ input on issues like this.” 
Wilson downplays the controversies and 
says the organization is thriving. “Co- 
chrane is delivering on its strategy and is 
in a very good place.” 

Cochrane is holding new board elections 
in the coming weeks. Antes hopes candi- 
dates will support the changes he and others 
have proposed. But the conflict has already 
harmed Cochrane’s credibility, says Jos 
Verbeek, coordinating editor of a Cochrane 
review group on occupational medicine. 
“This has had a big impact,” says Verbeek, 
who’s at the Finnish Institute of Occupa- 
tional Health in Helsinki. “People are won- 
dering, ‘What is happening at Cochrane, the 
most trustworthy source of evidence?’” 


With reporting by Martin Enserink. 
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Reprogrammed cells could 
tackle brain damage 


Turning astrocytes into neurons improves symptoms 


in preliminary mouse studies 


By Kelly Servick, in San Diego, California 


f a diseased or injured brain has lost 
neurons, why not ask other cells to 
change jobs and pick up the slack? Sev- 
eral research teams have taken a first 
step by “reprogramming” abundant 
nonneuronal cells called astrocytes into 
neurons in the brains of living mice. 

“Everybody is astonished, at the moment, 
that it works,” says Nicola Mattugini, aneuro- 
biologist at Ludwig Maximilian University 
in Munich, Germany, who presented the 
results of one such experiment here at the 
annual meeting of the Society for Neuro- 
science last week. 

Now, labs are turning to the next ques- 
tions: Do these neurons function like the 
lost ones, and does creating neurons at the 
expense of astrocytes do brain-damaged 
animals any good? Many researchers remain 
skeptical on both counts. But Mattugini’s 
team, led by neuroscientist Magdalena Gotz, 
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and two other groups presented evidence at 
the meeting that reprogrammed astrocytes 
do, at least in some respects, impersonate 
the neurons they're meant to replace. The 
two other groups also shared evidence that 
reprogrammed astrocytes help mice recover 
movement lost after a stroke. 

Some see the approach as a potential 
alternative to transplanting stem cells (or 
stem cell-derived neurons) into the dam- 
aged brain or spinal cord. Clinical trials 
of that strategy are already underway for 
conditions including Parkinson’s disease 
and spinal cord injury. But Gong Chen, a 
neuroscientist at Pennsylvania State Uni- 
versity in State College, says he got disil- 
lusioned with the idea after finding in his 
rodent experiments that transplanted cells 
produced relatively few neurons, and those 
few weren’t fully functional. The recent 
discovery that mature cells can be nudged 
toward new fates pointed to a better ap- 
proach, he says. His group and others took 
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Researchers have converted astrocytes (red) into 
neurons (green) in a living mouse brain. 


aim at the brain’s most abundant cell, the 
star-shaped astrocyte. 

Astrocytes are glial cells, named for the 
misconception that they're merely the brain’s 
structure-giving “glue.” In fact, they nourish 
and communicate with neurons and help 
control blood flow. After an injury, subsets 
of astrocytes proliferate, promote inflam- 
mation, and contribute to the formation of a 
scar. Many scientists think astrocytes’ effects 
on recovery are contradictory—some helpful 
and some harmful. 

“IT cannot imagine another technology to 
be more efficient than using the neighbor- 
ing glial cell” to repair the brain, Chen says. 
His group enlisted a harmless virus that, in- 
jected into the brain, infects astrocytes and 
introduces DNA that codes for NeuroD1, 
a transcription factor that activates genes 
typically expressed in neurons. The repro- 
gramming apparently prompts other astro- 
cytes to multiply, which he thinks might 
prevent the treatment from depleting the 
brain of astrocytes. 

The approach, under development in sev- 
eral labs working with various transcription 
factors, is “super provocative,’ says Timothy 
Murphy, a neuroscientist at The University 
of British Columbia in Vancouver, Canada, 
who studies how brain circuits change after 
stroke. But, he adds: “These cells need to sur- 
vive, and they need to reconnect.” 

No group has yet shown that the repro- 
grammed cells do wire up into circuits to 
carry out the functions of lost neurons. But 
several have evidence that the cells take on 
key neural features. In the weeks after induc- 
ing a stroke in a mouse’s brain, Chen’s team 
saw reprogrammed astrocytes retract some 
of their starlike tendrils and begin to produce 
hallmark neural proteins. Reprogrammed as- 
trocytes also appear to fire electrical signals 
and extend new fibers across the brain and 
into the spinal cord. 

G6tz’s team, meanwhile, documented that 
newly reprogrammed neurons around the 
site of a stab wound resemble pyramidal 
neurons, which send excitatory signals. (Her 
group, like others, is now teasing out how dif- 
ferent combinations of transcription factors 
prompt astrocytes to become different types 
of neurons.) The researchers also found that 
newly reprogrammed neurons express differ- 
ent markers and send out different projec- 
tions depending on which layer of the cortex 
they are in, just as native neurons do. 

That’s “very surprising,’ says Chun-Li 
Zhang, a neuroscientist at the University of 
Texas Southwestern Medical Center in Dal- 
las. He is exploring a different reprogram- 
ming process, which turns astrocytes into 
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primitive neural progenitor cells that then 
become neurons more gradually. Both ap- 
proaches will have to overcome skepticism, 
he says. Many researchers don’t expect neu- 
ral newcomers, introduced abruptly into the 
adult brain, to mature and function normally. 

“To really convince people,” he says, “we 
need to be really careful” to document the 
steps in the transformation of these cells— 
and to prove that they begin as astrocytes 
and finish as mature neurons. 

Researchers have also begun to look for 
indications that the approach helps ani- 
mals heal. In a study posted in April on the 
preprint server bioRxiv, Chen’s group re- 
ported that reprogrammed cells improved 
a mouse’s ability to walk and use its front 
limbs after a stroke. At the meeting, he 
hinted that the same approach had restored 
neural tissue in the brains of stroke-injured 
monkeys; experiments to gauge their recov- 
eries are ongoing at a collaborator’s facility 
in China, he says. 

Chen has founded a company to develop 
therapies with astrocyte reprogramming, 
including a cocktail of small molecules that 
could reprogram cells without brain sur- 
gery or the use of a virus. “I believe this is 
the future,” he told the audience at his con- 
ference presentation. “It’s the next frontier 
in regenerative medicine.” 

Stem cell biologist Cindi Morshead of the 
University of Toronto (U of T) in Canada is 
more circumspect. Scientists don’t fully un- 
derstand the role of astrocytes in the brain 
after an injury, she says, but “they’re there 
for a purpose.” As her group prepared to 
test the strategy, she expected it to make in- 
jured animals worse. 

She’s more optimistic now. At the confer- 
ence, her U of T collaborator Maryam Faiz 
revealed that mice injected with NeuroD1 a 
week after a stroke recovered motor func- 
tion more quickly than untreated mice, 
some of which were permanently disabled. 
By 2 months after the treatment, mice per- 
formed about as well as healthy controls on 
walking tests. Fully 20% of their neurons 
were reprogrammed cells. 

The results in stroke are among the 
first glimmers of benefit. Last year, Swed- 
ish researchers also reported that they 
had restored some motor function in a 
mouse model of Parkinson’s disease by 
reprogramming astrocytes into dopamine- 
producing neurons. 

Morshead’s_ results have encouraged 
her to continue experimenting. She now 
wants to wait longer after a stroke to inject 
her mice. Once stroke disability becomes 
chronic in humans, “we have nothing for 
them,” she says. If long-disabled mice ben- 
efit from their new neurons, she says, “now, 
that would be the coolest thing.” 
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Large galaxy found lurking 
on the Milky Way’s far side 


Antlia 2 rivals the Large Magellanic Cloud in size 
but is strangely dim, a puzzle for theorists 


By Adam Mann 


ircling our galaxy is a stealthy giant. 

Astronomers have discovered a dwarf 

galaxy, called Antlia 2, that is one- 

third the size of the Milky Way itself. 

As big as the Large Magellanic Cloud, 

the galaxy’s largest companion, Ant- 
lia 2 eluded detection until now because it 
is 10,000 times fainter. Such a strange beast 
challenges models of galaxy formation and 
dark matter, the unseen stuff that helps pull 
galaxies together. 

“It’s a very odd object and kind of excit- 
ing because we don’t know yet how to in- 
terpret all of its properties,” 
says Andrey Kravtsov of The 
University of Chicago in II- 
linois, who was not involved 


‘It’s ... kind of 


their parallax: a shift in their apparent posi- 
tion as Earth moves around the sun. More 
distant stars appear fixed in one spot. After 
removing the parallax-bearing stars, the re- 
searchers homed in on more than 100 red 
giant stars moving together in the constella- 
tion Antlia, they report in a paper posted to 
the preprint server arXiv this week. The gi- 
ants mark out a sprawling companion galaxy 
100 times less massive than anything of simi- 
lar size, with far fewer stars. 

To explain such a diffuse galaxy, Belokurov 
suggests that early in Antlia 2’s history, many 
young stars exploded as violent supernovae. 
This would have blown gas and dust out of 
the galaxy, weakening its grav- 
ity so that it puffed up. An 
abundance of the heavy ele- 
ments that are strewn from 


in the work. exciting because the guts of exploding stars 

The galaxy was discov- A adds credibility to this idea, 
ered with data from the we don t know says Shea Garrison-Kimmel, 
European Space Agency’s yet how to an astrophysicist at the Cali- 
Gaia satellite, a space tele- ‘ fornia Institute of Technology 
scope measuring the motions inter, pr et all of in Pasadena. Antlia 2 could 
and properties of more than its properties.” also have lost matter as stars 
1 billion stars in and around were tugged away by gravita- 
the Milky Way. Gabriel Andrey Kravtsov, — tional tidal forces as it orbited 

The University of Chicago 


Torrealba, an astronomy post- 
doc at the Academia Sinica 
in Taipei, decided to sift the data for RR 
Lyrae stars. These old stars, often found in 
dwarf galaxies, shine with a throbbing blue 
light that pulses at a rate signaling their in- 
herent brightness, allowing researchers to 
pin down their distance. 

“RR Lyrae are so rare at these distances 
that even if you see two, you question why 
they are together,’ says Vasily Belokurov, an 
astronomer at the University of Cambridge 
in the United Kingdom and a collabora- 
tor on the discovery. When the team found 
three, some 420,000 light-years away, it was 
“an overwhelming signal” of a large cluster 
of stars in that location, Belokurov says. But 
because the RR Lyrae stars lie on the far side 
of the disk of the Milky Way and its obscuring 
veils of stars and gas, finding their compan- 
ions was not easy. 

Gaia data helped the team see past the 
foreground stars. Objects in the Milky Way’s 
disk are close enough for Gaia to measure 
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around the larger Milky Way. 

Even so, its large size is 
hard to explain. Galaxies are thought to have 
formed when the gravity of enormous clumps 
of dark matter drew in enough ordinary mat- 
ter to fuel the birth of stars. The team specu- 
lates that Antlia 2 might have been born from 
a fluffier, faster-moving type of dark matter 
than current models hypothesize. 

To Garrison-Kimmel, one example isn’t 
enough to say the dark matter in Antlia 2 is 
different from that in the Milky Way and its 
other satellites. “There’s nothing in this one 
galaxy that screams to me that we need to re- 
think dark matter,” he says. “But if there are a 
lot of these, then we might need to take a step 
back and ask what’s going on.” 

That could happen now that astronomers 
know how to find these big, elusive com- 
panions. “I think this object is a harbinger,” 
Kravtsov says. “A taste of things to come.” 


Adam Mann is a journalist in 
Oakland, California. 
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n a bright July day 2 years 
ago, Kurt Kjzr was in a heli- 


copter flying over northwest 
Greenland—an expanse of ice, 
sheer white and _ sparkling. 


Soon, his target came into view: 

Hiawatha Glacier, a slow-moving 

sheet of ice more than a kilometer 

thick. It advances on the Arctic 
Ocean not in a straight wall, but in a con- 
spicuous semicircle, as though spilling out 
of a basin. Kjer, a geologist at the Natural 
History Museum of Denmark in Copenha- 
gen, suspected the glacier was hiding an 
explosive secret. The helicopter landed near 
the surging river that drains the glacier, 
sweeping out rocks from beneath it. Kjezr 
had 18 hours to find the mineral crystals 
that would confirm his suspicions. 

What he brought home clinched the case 
for a grand discovery. Hidden beneath Hi- 
awatha is a31-kilometer-wide impact crater, 
big enough to swallow Washington, D.C., 
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Kjzr and 21 co-authors report this week 
in a paper in Science Advances. The cra- 
ter was left when an iron asteroid 1.5 kilo- 
meters across slammed into Earth, possi- 
bly within the past 100,000 years. 

Though not as cataclysmic as_ the 
dinosaur-killing Chicxulub impact, which 
carved out a 200-kilometer-wide crater 
in Mexico about 66 million years ago, the 
Hiawatha impactor, too, may have left an 
imprint on the planet’s history. The timing 
is still up for debate, but some researchers 
on the discovery team believe the asteroid 
struck at a crucial moment: roughly 13,000 
years ago, just as the world was thawing 
from the last ice age. That would mean it 
crashed into Earth when mammoths and 
other megafauna were in decline and peo- 
ple were spreading across North America. 

The impact would have been a spectacle 
for anyone within 500 kilometers. A white 
fireball four times larger and three times 
brighter than the sun would have streaked 
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across the sky. If the object struck an ice 
sheet, it would have tunneled through to 
the bedrock, vaporizing water and stone 
alike in a flash. The resulting explosion 
packed the energy of 700 1-megaton nuclear 
bombs, and even an observer hundreds of 
kilometers away would have experienced a 
buffeting shock wave, a monstrous thunder- 
clap, and hurricane-force winds. Later, rock 
debris might have rained down on North 
America and Europe, and the released 
steam, a greenhouse gas, could have locally 
warmed Greenland, melting even more ice. 

The news of the impact discovery has 
reawakened an old debate among scien- 
tists who study ancient climate. A mas- 
sive impact on the ice sheet would have 
sent meltwater pouring into the Atlantic 
Ocean—potentially disrupting the conveyor 
belt of ocean currents and causing tempera- 
tures to plunge, especially in the Northern 
Hemisphere. “What would it mean for spe- 
cies or life at the time? It’s a huge open 
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A1.5-kilometer asteroid, intact or in pieces, may have 
smashed into an ice sheet just 13,000 years ago. 


question,” says Jennifer Marlon, a paleo- 
climatologist at Yale University. 

A decade ago, a small group of scientists 
proposed a similar scenario. They were try- 
ing to explain a cooling event, more than 
1000 years long, called the Younger Dryas, 
which began 12,800 years ago, as the last 
ice age was ending. Their controversial 
solution was to invoke an extraterrestrial 
agent: the impact of one or more comets. 
The researchers proposed that besides 
changing the plumbing of the North Atlan- 
tic, the impact also ignited wildfires across 
two continents that led to the extinction 
of large mammals and the disappearance 
of the mammoth-hunting Clovis people 
of North America. The research group 
marshaled suggestive but inconclusive 
evidence, and few other scientists were 
convinced. But the idea caught the public’s 
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imagination despite an obvious limitation: 
No one could find an impact crater. 

Proponents of a Younger Dryas impact 
now feel vindicated. “I’d unequivocally 
predict that this crater is the same age as 
the Younger Dryas,’ says James Kennett, 
a marine geologist at the University of 
California, Santa Barbara, one of the idea’s 
original boosters. 

But Jay Melosh, an impact crater expert 
at Purdue University in West Lafayette, 
Indiana, doubts the strike was so recent. 
Statistically, impacts the size of Hiawatha 
occur only every few million years, he says, 
and so the chance of one just 13,000 years 
ago is small. No matter who is right, the 
discovery will give ammunition to Younger 
Dryas impact theorists—and will turn the 
Hiawatha impactor into another type of 
projectile. “This is a hot potato,’ Melosh 
tells Science. “You're aware you're going to 
set off a firestorm?” 


IT STARTED WITH a hole. In 2015, Kjzr and 
a colleague were studying a new map of 
the hidden contours under Greenland’s 
ice. Based on variations in the ice’s depth 
and surface flow patterns, the map of- 
fered a coarse suggestion of the bedrock 
topography—including the hint of a hole 
under Hiawatha. 

Kjer recalled a massive iron meteorite 
in his museum’s courtyard, near where he 
parks his bicycle. Called Agpalilik, Inuit for 
“the Man,” the 20-ton rock is a fragment of 
an even larger meteorite, the Cape York, 
found in pieces on northwest Greenland 
by Western explorers but long used by In- 
uit people as a source of iron for harpoon 
tips and tools. Kjzr wondered whether the 
meteorite might be a remnant of an impac- 
tor that dug the circular feature under Hi- 
awatha. But he still wasn’t confident that 
it was an impact crater. He needed to see 
it more clearly with radar, which can pen- 
etrate ice and reflect off bedrock. 

Kjer’s team began to work with Joseph 
MacGregor, a glaciologist at NASA’s God- 
dard Space Flight Center in Greenbelt, 
Maryland, who dug up archival radar data. 
MacGregor found that NASA aircraft of- 
ten flew over the site on their way to sur- 
vey Arctic sea ice, and the instruments 
were sometimes turned on, in test mode, 
on the way out. “That was pretty glorious,” 
MacGregor says. 

The radar pictures more clearly showed 
what looked like the rim of a crater, but 
they were still too fuzzy in the middle. 
Many features on Earth’s surface, such as 
volcanic calderas, can masquerade as cir- 
cles. But only impact craters contain cen- 
tral peaks and peak rings, which form at 
the center of a newborn crater when—like 
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the splash of a stone in a pond—molten 
rock rebounds just after a strike. To look 
for those features, the researchers needed 
a dedicated radar mission. 

Coincidentally, the Alfred Wegener In- 
stitute for Polar and Marine Research in 
Bremerhaven, Germany, had just pur- 
chased a next-generation ice-penetrating 
radar to mount across the wings and body 
of their Basler aircraft, a twin-propeller 
retrofitted DC-3 that’s a workhorse of Arc- 
tic science. But they also needed financing 
and a base close to Hiawatha. 

Kjer took care of the money. Traditional 
funding agencies would be too slow, or 
prone to leaking their idea, he thought. 
So he petitioned Copenhagen’s Carls- 
berg Foundation, which uses profits from 
its global beer sales to finance science. 
MacGregor, for his part, enlisted NASA 
colleagues to persuade the U.S. military 
to let them work out of Thule Air Base, a 
Cold War outpost on northern Greenland, 
where German members of the team had 
been trying to get permission to work for 
20 years. “I had retired, very serious Ger- 
man scientists sending me happy-face 
emojis,” MacGregor says. 

Three flights, in May 2016, added 
1600 kilometers of fresh data from dozens 
of transits across the ice—and evidence 
that Kjzr, MacGregor, and their team 
were onto something. The radar revealed 
five prominent bumps in the crater’s cen- 
ter, indicating a central peak rising some 
50 meters high. And in a sign of a recent 
impact, the crater bottom is exceptionally 
jagged. If the asteroid had struck earlier 
than 100,000 years ago, when the area 
was ice free, erosion from melting ice far- 
ther inland would have scoured the crater 
smooth, MacGregor says. The radar signals 
also showed that the deep layers of ice 
were jumbled up—another sign of a re- 
cent impact. The oddly disturbed patterns, 
MacGregor says, suggest “the ice sheet 
hasn’t equilibrated with the presence of 
this impact crater.” 

But the team wanted direct evidence to 
overcome the skepticism they knew would 
greet a claim for a massive young crater, 
one that seemed to defy the odds of how of- 
ten large impacts happen. And that’s why 
Kjer found himself, on that bright July 
day in 2016, frenetically sampling rocks 
all along the crescent of terrain encircling 
Hiawatha’s face. His most crucial stop 
was in the middle of the semicircle, near 
the river, where he collected sediments 
that appeared to have come from the gla- 
cier’s interior. It was hectic, he says—“one 
of those days when you just check your 
samples, fall on the bed, and don’t rise for 
some time.” 
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The hidden crater 


Under a lobe of ice on northwest Greenland, airborne radar and ground sampling have uncovered a 
giant and remarkably fresh impact crater. Though not as large as the dinosaur-killing Chicxulub impact, 
Hiawatha crater may have formed as recently as the end of the last ice age, as humans were spreading 
across North America. Meltwater from the impact could have triggered a thousand-year chill in the 
Northern Hemisphere by disrupting currents in the Atlantic Ocean. 
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Samples néar the glacier’s outlet 
contained beads of once-molten glass 
and shocked quartz—erystals scarred by 
high temperatures and pressures. 


In that outwash, Kjer’s team closed 
its case. Sifting through the sand, Adam 
Garde, a geologist at the Geological Survey 
of Denmark and Greenland in Copenha- 
gen, found glass grains forged at tempera- 
tures higher than a volcanic eruption can 
generate. More important, he discovered 
shocked crystals of quartz. The crystals 
contained a distinctive banded pattern 
that can be formed only in the intense 
pressures of extraterrestrial impacts or nu- 
clear weapons. The quartz makes the case, 
Melosh says. “It looks pretty good. All the 
evidence is pretty compelling.” 

Now, the team needs to figure out ex- 
actly when the collision occurred and how 
it affected the planet. 
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Seeing through ice 

A Basler BT-67 aircraft, fitted with radars 
onits belly and wings, crisscrossed the 
crater, looking for reflections. 


Ice-penetrating 
radar 


Disturbed ice 
oe 


Radar-detected peak 


Rebound effect 


THE YOUNGER DRYAS, named after a small 
white and yellow arctic flower that flour- 
ished during the cold snap, has long fas- 
cinated scientists. Until human-driven 
global warming set in, that period reigned 
as one of the sharpest recent swings in 
temperature on Earth. As the last ice age 
waned, about 12,800 years ago, tempera- 
tures in parts of the Northern Hemisphere 
plunged by as much as 8°C, all the way 
back to ice age readings. They stayed that 
way for more than 1000 years, turning ad- 
vancing forest back into tundra. 

The trigger could have been a disruption 
in the conveyor belt of ocean currents, in- 
cluding the Gulf Stream that carries heat 
northward from the tropics. In a 1989 pa- 
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After an impact, rebounding molten rock 

piles up in a central peak and sometimes 
collapses into a peak ring—one way to 
distinguish an impact crater.from a volcano. ~ 


Hiawatha 
crater 


Thule 


Air Base Camp Century 


Cape York NGRIP 


fragments 
Deep ice GIsP2 
core site 


Where is the impact debris? 
None of the drilled Greenland ice cores 
(red dots) contains meteoritic debris. 


. about 12,900 years ago. 


1 


Washington, D.C. 


The impact would have tunneled 
through ice and bedrock, leaving a 
crater 31 kilometers wide and more 
than 300 meters deep. 


yo ee 2, 


per in Nature, Kennett, along with Wallace 
Broecker, a climate scientist at Columbia 
University’s Lamont-Doherty Earth Obser- 
vatory, and others, laid out how meltwater 
from retreating ice sheets could have shut 
down the conveyor. As warm water from 
the tropics travels north at the surface, it 
cools while evaporation makes it saltier. 
Both factors boost the water’s density un- 
til it sinks into the abyss, helping to drive 
the conveyor. Adding a pulse of less-dense 
freshwater could hit the brakes. Paleo- 
climate researchers have largely endorsed 
the idea, although evidence for such a 
flood has been lacking until recently. 
Then, in 2007, Kennett suggested a new 
trigger. He teamed up with scientists led by 
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Richard Firestone, a physicist at Lawrence 
Berkeley National Laboratory in Califor- 
nia, who proposed a comet strike at the key 
moment. Exploding over the ice sheet cov- 
ering North America, the comet or comets 
would have tossed light-blocking dust into 
the sky, cooling the region. Farther south, 
fiery projectiles would have set forests 
alight, producing soot that deepened the 
gloom and the cooling. The impact also 
could have destabilized ice and unleashed 
meltwater that would have disrupted the 
Atlantic circulation. 

The climate chaos, the team suggested, 
could explain why the Clovis settle- 
ments emptied and the megafauna van- 
ished soon afterward. But the 
evidence was scanty. Firestone and 
his colleagues flagged thin sedi- 
ment layers at dozens of archaeo- 
logical sites in North America. 
Those sediments seemed to con- 
tain geochemical traces of an extra- 
terrestrial impact, such as a peak 
in iridium, the exotic element that 
helped cement the case for a Chicx- 
ulub impact. The layers also yielded 
tiny beads of glass and iron— 
possible meteoritic debris—and 
heavy loads of soot and charcoal, 
indicating fires. 

The team met immediate criti- 
cism. The decline of mammoths, 
giant sloths, and other species had 
started well before the Younger 
Dryas. In addition, no sign existed of 
a human die-off in North America, 
archaeologists said. The nomadic 
Clovis people wouldn’t have stayed 
long in any site. The distinctive spear 
points that marked their presence 
probably vanished not because the people 
died out, but rather because those weapons 
were no longer useful once the mammoths 
waned, says Vance Holliday, an archaeo- 
logist at The University of Arizona in Tuc- 
son. The impact hypothesis was trying to 
solve problems that didn’t need solving. 

The geochemical evidence also began to 
erode. Outside scientists could not detect 
the iridium spike in the group’s samples. 
The beads were real, but they were abun- 
dant across many geological times, and 
soot and charcoal did not seem to spike at 
the time of the Younger Dryas. “They listed 
all these things that aren’t quite sufficient,” 
says Stein Jacobsen, a geochemist at Har- 
vard University who studies craters. 

Yet the impact hypothesis never quite 
died. Its proponents continued to study the 
putative debris layer at other sites in Europe 
and the Middle East. They also reported 
finding microscopic diamonds at different 
sites that, they say, could have been formed 
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only by an impact. (Outside researchers 
question the claims of diamonds.) 

Now, with the discovery of Hiawatha 
crater, “I think we have the smoking gun,” 
says Wendy Wolbach, a geochemist at De- 
Paul University in Chicago, Illinois, who has 
done work on fires during the era. 

The impact would have melted 1500 giga- 
tons of ice, the team estimates—about as 
much ice as Antarctica has lost because 
of global warming in the past decade. The 
local greenhouse effect from the released 
steam and the residual heat in the crater 
rock would have added more melt. Much of 
that freshwater could have ended up in the 
nearby Labrador Sea, a primary site pump- 


ing the Atlantic Ocean’s overturning circu- 
lation. “That potentially could perturb the 
circulation,” says Sophia Hines, a marine 
paleoclimatologist at Lamont-Doherty. 

Leery of the earlier controversy, Kjeer 
won't endorse that scenario. “I’m not put- 
ting myself in front of that bandwagon,” he 
says. But in drafts of the paper, he admits, 
the team explicitly called out a possible con- 
nection between the Hiawatha impact and 
the Younger Dryas. 


THE EVIDENCE STARTS with the ice. In the 
radar images, grit from distant volcanic 
eruptions makes some of the boundar- 
ies between seasonal layers stand out as 
bright reflections. Those bright layers can 
be matched to the same layers of grit in 
cataloged, dated ice cores from other parts 
of Greenland. Using that technique, Kjzer’s 
team found that most ice in Hiawatha is 
perfectly layered through the past 11,700 
years. But in the older, disturbed ice be- 
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low, the bright reflections disappear. Trac- 
ing the deep layers, the team matched the 
jumble with debris-rich surface ice on Hi- 
awatha’s edge that was previously dated 
to 12,800 years ago. “It was pretty self- 
consistent that the ice flow was heavily dis- 
turbed at or prior to the Younger Dryas,” 
MacGregor says. 

Other lines of evidence also suggest 
Hiawatha could be the Younger Dryas 
impact. In 2013, Jacobsen examined an 
ice core from the center of Greenland, 
1000 kilometers away. He was expecting 
to put the Younger Dryas impact theory 
to rest by showing that, 12,800 years ago, 
levels of metals that asteroid impacts 


NASA and German aircraft used radar to see the contours of an impact crater beneath the ice of Hiawatha Glacier. 


tend to spread did not spike. Instead, he 
found a peak in platinum, similar to ones 
measured in samples from the crater site. 
“That suggests a connection to the Younger 
Dryas right there,” Jacobsen says. 

For Broecker, the coincidences add 
up. He had first been intrigued by the 
Firestone paper, but quickly joined the ranks 
of naysayers. Advocates of the Younger Dryas 
impact pinned too much on it, he says: the 
fires, the extinction of the megafauna, the 
abandonment of the Clovis sites. “They put 
a bad shine on it.” But the platinum peak 
Jacobsen found, followed by the discovery of 
Hiawatha, has made him believe again. “It’s 
got to be the same thing,” he says. 

Yet no one can be sure of the timing. 
The disturbed layers could reflect nothing 
more than normal stresses deep in the ice 
sheet. “We know all too well that older ice 
can be lost by shearing or melting at the 
base,” says Jeff Severinghaus, a paleoclima- 
tologist at the Scripps Institution of Ocean- 
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In 2016, Kurt Kjzer looked for evidence of an impact in sand washed out from underneath Hiawatha Glacier. He would find glassy beads and shocked crystals of quartz. 


ography in San Diego, California. Richard 
Alley, a glaciologist at Pennsylvania State 
University in University Park, believes the 
impact is much older than 100,000 years 
and that a subglacial lake can explain the 
odd textures near the base of the ice. “The 
ice flow over growing and shrinking lakes 
interacting with rough topography might 
have produced fairly complex structures,” 
Alley says. 

A recent impact should also have left 
its mark in the half-dozen deep ice cores 
drilled at other sites on Greenland, which 
document the 100,000 years of the current 
ice sheet’s history. Yet none exhibits the thin 
layer of rubble that a Hiawatha-size strike 
should have kicked up. “You really ought to 
see something,” Severinghaus says. 

Brandon Johnson, a planetary scientist 
at Brown University, isn’t so sure. After 
seeing a draft of the study, Johnson, who 
models impacts on icy moons such as Eu- 
ropa and Enceladus, used his code to re- 
create an asteroid impact on a thick ice 
sheet. An impact digs a crater with a cen- 
tral peak like the one seen at Hiawatha, he 
found, but the ice suppresses the spread of 
rocky debris. “Initial results are that it goes 
a lot less far,’ Johnson says. 


EVEN IF THE ASTEROID struck at the right 
moment, it might not have unleashed all 
the disasters envisioned by proponents of 
the Younger Dryas impact. “It’s too small 
and too far away to kill off the Pleisto- 
cene mammals in the continental United 
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States,” Melosh says. And how a strike 
could spark flames in such a cold, barren 
region is hard to see. “I can’t imagine how 
something like this impact in this location 
could have caused massive fires in North 
America,” Marlon says. 

It might not even have triggered the 
Younger Dryas. Ocean sediment cores 
show no trace of a surge of freshwater into 
the Labrador Sea from Greenland, says 
Lloyd Keigwin, a paleoclimatologist at the 
Woods Hole Oceanographic Institution in 
Massachusetts. The best recent evidence, 


Banded patterns in the mineral quartz are diagnostic 
of shock waves from an extraterrestrial impact. 


he adds, suggests a flood into the Arctic 
Ocean through western Canada instead. 
An external trigger may be unnecessary 
in any case, Alley says. During the last ice 
age, the North Atlantic saw 25 other cooling 
spells, probably triggered by disruptions 
to the Atlantic’s overturning circulation. 
None of those spells, known as Dansgaard- 
Oeschger (D-O) events, was as severe as 
the Younger Dryas, but their frequency 
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suggests an internal cycle played a role 
in the Younger Dryas, too. Even Broecker 
agrees that the impact was not the ulti- 
mate cause of the cooling. If D-O events 
represent abrupt transitions between two 
regular states of the ocean, he says, “you 
could say the ocean was approaching in- 
stability and somehow this event knocked 
it over.” 

Still, Hiawatha’s full story will come down 
to its age. Even an exposed impact crater can 
be a challenge for dating, which requires 
capturing the moment when the impact al- 
tered existing rocks—not the original age of 
the impactor or its target. Kjzr’s team has 
been trying. They fired lasers at the glassy 
spherules to release argon for dating, but 
the samples were too contaminated. The 
researchers are inspecting a blue crystal 
of the mineral apatite for lines left by the 
decay of uranium, but it’s a long shot. The 
team also found traces of carbon in other 
samples, which might someday yield a 
date, Kjzr says. But the ultimate answer 
may require drilling through the ice to the 
crater floor, to rock that melted in the im- 
pact, resetting its radioactive clock. With 
large enough samples, researchers should 
be able to pin down Hiawatha’s age. 

Given the remote location, a drilling 
expedition to the hole at the top of the 
world would be costly. But an understand- 
ing of recent climate history—and what a 
giant impact can do to the planet—is at 
stake. “Somebody’s got to go drill in there,” 
Keigwin says. “That’s all there is to it.” 
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Lighting up the nighttime 
Artificial light at night needs to be reduced to limit 
negative environmental impacts 


By Kevin J. Gaston 


mong the most visually compelling 
images of the whole Earth have been 
those created using data obtained 
at night by astronauts or from sat- 
ellites. The proliferation in use of 
electric lighting—including from 
industrial, commercial, municipal, and do- 
mestic sources—is striking. It sketches the 
spatial distribution of much of the human 
population, outlining a substantial propor- 
tion of the world’s coastline, highlighting a 
multitude of towns and cities, and drawing 
the major highways that connect them. The 
data embodied in these nighttime images 
have been used to estimate and map levels 


744 16 NOVEMBER 2018 + VOL 362 ISSUE 6416 


of energy use, urbanization, and economic 
activity. They have also been key in focus- 
ing attention on the environmental impacts 
of the artificial light at night itself. Explicit 
steps need to be taken to limit these im- 
pacts, which vary according to the intensity, 
spectrum, spatial extent, and temporal dy- 
namics of this lighting. 


FROM DIRECT LIGHT TO SKYGLOW 

Artificial light at night can usefully be 
thought of as having two linked components. 
The first component—direct emissions from 
outdoor lighting sources, which include 
streetlights, building and _ infrastructure 
lighting, and road vehicle headlamps—is spa- 
tially extremely heterogeneous. Ground-level 
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illuminance in the immediate vicinity can 
vary from less than 10 lux (1x) to more than 
100 1x (for context, a full moon on a clear 
night has an illuminance of up to 0.1 1x). It 
often declines rapidly over distances of a few 
meters. However, emissions from unshielded 
lights can, when unobstructed, carry horizon- 
tally over many kilometers, making artificial 
light at night both an urban and a rural issue. 

The second component of artificial light at 
night is skyglow, the brightening of the night- 
time sky caused mainly by upwardly emitted 
and reflected artificial light that is scattered 
in the atmosphere by water, dust, and gas 
molecules. Although absolute illuminance 
levels are at most about 0.2 to 0.5 Ix, much 
lower than those from direct emissions, these 
are often sufficiently high to obscure the 
Milky Way, which is used for orientation by 
some organisms. In many urban areas, sky- 
glow even obscures lunar light cycles, which 
are used by many organisms as cues for bio- 
logical activity. 


Environment and Sustainability Institute, University of Exeter, 
Penryn, Cornwall TR10 9FE, UK. Email: k.j.gaston@exeter.ac.uk 


sciencemag.org SCIENCE 


PHOTO: NASA, SCOTT KELLY 


8102 6} JOQUIOAON 


The bright lights of Tokyo and other cities 

in Japan, photographed from the International 
Space Station in 2015, show the proliferation 
of artificial light on our planet. 


In the laboratory, organismal responses, 
such as suppression of melatonin levels and 
changes to behavioral activity patterns, gener- 
ally increase with greater intensities of artifi- 
cial light at night. It is challenging to establish 
the form of such functional relationships in 
the field, but experiments and observations 
have shown that commonplace levels of artifi- 
cial light at night influence a wide range of bi- 
ological phenomena across a wide diversity of 
taxa, including individual physiology and be- 
havior, species abundances and distributions, 
community structure and dynamics, and eco- 
system function and process (J). Exposure to 
even dim nighttime lighting (below 1 Ix) can 
drastically change activity patterns of both 
naturally day-active and night-active species. 
These effects can be exacerbated by trophic 
interactions, such that the abundances of spe- 
cies whose activity is not directly altered may 
nonetheless be severely affected under low 
levels of nighttime lighting (2). 


SHIFTING SPECTRA 

Globally, the prevailing technology of out- 
door lighting is undergoing a marked shift 
to light-emitting diodes (LEDs). Although 
LEDs can be used to produce a wide di- 
versity of emission spectra, the main trend 
has been for narrower-spectrum street 
lighting with lower (“warmer”) correlated 
color temperature (CCT) to be replaced by 
broader-spectrum LED lamps with higher 
(“cool white”) CCT. This lighting provides 
improved color rendering, more faithfully 
revealing colors as seen under sunlight, 
but also tends to exacerbate skyglow unless 
accompanied by dimming and improved 
shielding (3). Biological responses to light 
are almost invariably spectrum-dependent, 
and broadening the spectrum of emissions 
increases the likelihood of their overlapping 
with these patterns of sensitivity, often in- 
creasing the biological impact. 

Of particular concern is the growth in 
emissions of blue wavelengths, to which 
melatonin suppression is disproportionately 
sensitive. Multiple cascading processes can 
include stress responses, disease risk, and 
likelihood of obesity. Medical organizations 
have advised that poorly designed high- 
intensity and high-CCT street lighting should 
be avoided to minimize potential harm to 
human sleep patterns, sleep quality, and cir- 
cadian rhythms (4). Studies have also raised 
concerns that greater exposure to artificial 
light at night increases some cancer risks (5). 
However, it is difficult to isolate effects of out- 
door lighting from those of indoor lighting 
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(including the trespass of outdoor lighting 
indoors) and to adequately control for other 
risk factors to human health. Concerted ef- 
fort needs to be invested in assessing the 
existing evidence for such impacts both in 
principle and practice, and to find improved 
methods for measuring these impacts. 


SPATIAL PATTERNS 
The global importance of the impacts of ar- 
tificial light at night rests in large part on its 
spatial extent. The direct footprint of light 
emissions is hard to estimate, being heavily 
dependent on the spatial resolution of avail- 
able data. According to the best estimates, 
this extent is increasing at about 2% per year, 
with growth in the intensity of lighting from 
already lit areas occurring at a similar rate 
(6). The reduced operational costs of using 
LED lamps seem to have encouraged the in- 
stallation of yet more lighting, rather than 
savings on preexisting lighting needs. 
Conservatively, the overall coverage by 
skyglow is now nearly one-fourth of global 
land area, with 83% of the human popu- 
lation estimated to be living under light- 
polluted skies (7). Skyglow can extend 
hundreds of kilometers from urban sources, 
changing the nighttime environment in 
places that may be protected from many 
other anthropogenic pressures. Yet those 
persons responsible for these distant effects 
rarely recognize their role in creating them, 
nor are they held accountable. 
Understanding of landscape-scale impacts 
of artificial light at night is in its infancy. 
Nonetheless, there is evidence that the attrac- 
tion of nighttime-migrating birds to artificial 
light sources has strong negative impacts on 
their routes, their stopover habitat selection, 
and likely their energetics (8, 9). Artificial 
light at night is also suspected to have played 
a role in catastrophic region-wide declines 
in insect populations. Although categori- 
cal evidence remains wanting, those species 
of moths that would seem most vulnerable, 
such as those attracted to lights or that are 
night-active, have been shown to have experi- 
enced the greatest losses (J0). 


CHANGING NIGHTTIME LIGHT PATTERNS 
Artificial lighting truncates the duration 
of darkness attained in lit areas. Stationary 
and mobile (e.g., vehicle headlamp) sources 
tend to be switched on around the onset of 
dusk, decline to some degree as nighttime 
progresses (with some stationary lights be- 
ing switched off and traffic levels declining), 
and continue until around the conclusion of 
dawn. As a consequence, the sky over urban 
areas often becomes somewhat darker as 
night progresses, whereas in nearby rural lo- 
cations the sky becomes brighter as the Moon 
rises and darker as it sets (77). 
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Given the vital role of natural light cycles 
as cues for daily and seasonal timings of 
biological activities, it is unsurprising that 
these changes wrought by artificial light 
at night can alter those timings (J2). This 
applies not only to animals. For example, 
artificial light at night has been found to ad- 
vance the timing of budburst in temperate 
trees by several days (13). The magnitude of 
such changes can be similar to those caused 
by climate change, raising questions as to 
how the effects of climate change and artifi- 
cial nighttime lighting interact. 


REDUCING THE IMPACTS OF 

ARTIFICIAL LIGHT 

Given the apparent pervasiveness of the 
negative biological impacts of artificial light 
at night, it is vital that these be reduced. Lag 
effects are common to many anthropogenic 
pressures on the environment. For example, 
even if levels of CO, emissions were to be 
dramatically reduced, Earth would continue 
to warm. However, such lags seem much less 
likely for the effects of artificial light at night. 
Reductions in artificial light at night would 
not result in instant recovery, but such recov- 
ery could be relatively swift. 

Limiting the use of artificial light at night 
to the places, times, and forms required to 
ensure that people can use the nighttime 
appropriately would enable drastic reduc- 
tions in artificial light at night in much of the 
world. Artificial light at night brings tangible 
benefits to people, most notably in extend- 
ing the time available for work and social 
activities. However, existing regulations and 
requirements of lighting focusing on issues of 
safety and security are seldom supported by 
robust empirical evidence (J4). 

To reduce the negative effects of artificial 
light at night requires a blend of common 
sense and exploitation of technology. First, 
artificial light at night should not casually 
be introduced into areas in which it has 
not previously occurred, especially in those 
regions in which naturally dark spaces are 
now scarce (e.g., Western Europe, the eastern 
United States, East Asia). 

Second, lighting should be at the lowest 
realistic intensity. The environmental im- 
pacts and energy costs of artificial light at 
night could be much reduced if emissions 
were cut to the low levels already used in 
some cities (e.g., Berlin). 

Third, outdoor sources should be designed 
to ensure that lighting is limited to the places 
where it is actually required; many of the 
problems caused by artificial light at night 
result from poor lighting design, especially 
inadequate shielding. To date, attention here 
has fallen foremost on improved shielding of 
streetlights, which are often funded from the 
public purse. However, attention also needs 
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to be paid to the impacts of other major lo- 
cal contributors to artificial light at night, in- 
cluding gas stations, parking lots, transport 
hubs, sports stadiums, and billboards. 

Fourth, lighting should only be used at 
times when it is required; most outdoor light- 
ing should routinely be dimmed or switched 
off during periods of low demand. Although, 
despite energy savings, the costs of smart 
technologies remain limiting, these can assist 
in assessing needs and modifying lighting 
levels accordingly. 

Finally, a more nuanced approach needs 
to be taken with regard to the spectrum of 
lighting, with preferential use of lower-CCT 
(<2400 K) and thus environmentally less 
disruptive lamps. Near environmentally sen- 
sitive zones (e.g., around protected areas, 
around otherwise dark habitat corridors), 
only narrow-spectrum lighting should be 
used, if lighting is needed at all. Use of dim- 
mer and warm white or amber lighting along 
rural roads and in residential areas would 
help to reduce impacts, even if brighter and 
higher-CCT lighting is used in areas with 
heavier nighttime use by people and at busy 
road junctions. The present trend toward 
widespread introduction of high-CCT light- 
ing (4000 K) brings major negative envi- 
ronmental impacts, relative to lower-CCT 
lighting, for no obvious benefit. 

The ongoing and planned modernization 
of public lighting systems in many regions of 
the world provides a vital and ready oppor- 
tunity to reduce the environmental impacts 
of artificial light at night. It could also facili- 
tate the establishment of new norms that can 
help to shape future developments (includ- 
ing in regions that currently have no or little 
urbanization) that otherwise have enormous 
potential to further erode the nighttime, such 
as the inevitable growth in off-grid lighting 
schemes. Future nighttime images of Earth 
need not be records of further progression to- 
ward loss of the nighttime and its benefits. 
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Endothelial cell transitions 


Are endothelial-to-mesenchymal transitions in various 
vascular pathologies a consequence, cause, or defense? 


By Elisabetta Dejana’? and 
Maria Grazia Lampugnani** 


ndothelial cells cover the internal sur- 

face of all types of vessels in the body 

and play a highly specialized role in 

protecting the vessel wall and the un- 

derlying tissues from noxious stimuli. 

These cells show organ-directed spe- 
cialization and adapt to the requirements of 
different organs. However, in pathological 
conditions—such as inflammation, fibrosis, 
and atherosclerosis—endothelial cells can 
change their morphological and functional 
characteristics and acquire properties of 
other cell lineages such as fibroblasts, myofi- 
broblasts, smooth muscle cells, and pericytes 
(which wrap around vessels) in a process 
called endothelial-to-mesenchymal transition 
(EndMT). This change of endothelial pheno- 
type may increase the vascular responses to 
thrombosis (blood clot), decrease permeabil- 
ity control, and increase fibrotic reactions. 
In addition, when endothelial cells undergo 
EndMT, they release abnormal amounts and 
types of growth factors and extracellular 
matrix proteins that constitute important 
mediators in a dysfunctional cross-talk with 
the surrounding cells. Given the relevance of 
EndMT in several pathologies, understand- 
ing the molecular basis of EndMT could be 
instrumental for the development of new 
therapeutic interventions. 

The switch to a mesenchymal phenotype 
is not a simple binary event. Instead, it oc- 
curs as a continuum with temporal changes 
in endothelial and mesenchymal marker 
expression. Cells that express at the same 
time both endothelial and mesenchymal 
markers are frequently observed in vivo in 
samples from both mice and humans, and 
detection of such coexpression has been 
crucial for the documentation of EndMT 
(1). It is therefore tempting to hypothesize 
that in a manner similar to the better char- 
acterized epithelial-to-mesenchymal transi- 
tion (EMT), endothelial cells undergo a set 
of multiple and dynamic transitional states 
characterized by the fine-tuning of different 
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transcription factors, sequential expression 
of mesenchymal markers, and modification 
of cellular functions (2—5). EndMT can be 
reverted pharmacologically in cultured cells 
(2), although the persistence of such rever- 
sion upon drug withdrawal is not known. 
Currently, we do not know whether EndMT 
is a reversible phenomenon in vivo (to 
which extent it is reversible and in response 
to which stimuli) or whether some cells 
can reach and maintain an intermediate 
phenotype without progression to a clearly 
defined mesenchymal cell phenotype. These 
considerations are important because the 
progressive acquisition of mesenchymal 
features corresponds to different functional 
responses of the cell. These questions re- 
quire further experimental investigation. 

The process of EndMT has been docu- 
mented to occur in many pathological model 
systems and also in humans, although the 
actual contribution of this phenomenon to 
pathological dysfunctions remains a matter 
of debate (6—9). Skepticism about the biologi- 
cal relevance of EndMT appears mainly be- 
cause of technical limitations, including that 
individual mesenchymal cells derived from 
samples with evidence of EndMT are not al- 
ways easily distinguishable from fibroblasts 
of other origins, such as mesenchymal stem 
cells or stromal cells that are normally within 
tissues; it is also difficult to track EndMT in 
vivo. We envisage that EndMT is the result 
of a multistep phenomenon that follows spe- 
cific kinetics. Consistently, endothelial cells 
express different mesenchymal markers at 
different time points after EndMT-triggering 
events (2). Moreover, endothelial cells can- 
not be synchronized in vivo, and this leads to 
variability in the type of markers expressed. 
Additionally, the costaining of cells with spe- 
cific endothelial and mesenchymal markers 
can be difficult to interpret if the antibodies 
are not strictly specific and the signals suf- 
ficiently bright; cell-tracking experiments in 
mice are valuable, although the promoters 
that drive gene activation for those that en- 
code endothelial cell markers can be leaky 
and can also be expressed at low levels by 
nonendothelial cell types, thus making the 
interpretation of data difficult. 

These technical problems, however, can be 
solved to a large extent through the combina- 
tion of different approaches, such as in vivo 
cell-lineage tracing in mice and in vitro cell 
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characterization for the expres- 
sion of multiple markers. Most 
importantly, cell tracking stud- 
ies will become possible in vivo 
in humans by use of highly ad- 
vanced imaging techniques (J0). 
These approaches could be used 
to track endothelial cell dynam- 
ics and phenotype in vivo in vari- 
ous disease settings. 

In the embryo, EndMT is a _calis 
physiological process that is 
necessary for correct heart de- 
velopment (2, 10). The cells of 
the endocardium undergo the 
EndMT switch to be able to eas- 
ily migrate into the cardiac jelly 
and to participate in valve for- 
mation. This process is strictly 
controlled in time and space dur- 
ing development. In the adult, 
EndMT has _ been associated 
mainly with pathological condi- 
tions characterized by vascular 
degeneration and fibrosis. For 
example, in atherosclerosis, endothelial cells 
lose their vascular cell fate and acquire peri- 
vascular smooth muscle and mesenchymal 
characteristics and thus participate in the 
thickening of the muscular layer of arteries 
(media) that is typical of this condition (2, 7, 
11). EndMT is also implicated in other patho- 
logical reactions, including vascular calcifica- 
tion; pulmonary arterial hypertension; vein 
graft rejections; heart, kidney, and lung fibro- 
sis; cancer-associated fibrosis; fibrodysplasia 
ossificans progressive (a condition in which 
muscle and connective tissue is replaced 
with bone); and brain vascular cavernomas 
(clusters of abnormal blood vessels) (2, 4, 
7, 8, 11-13). In these conditions, endothelial 
cells react to a pathological environment by 
acquiring mesenchymal characteristics and 
thereby potentially contribute to the evolu- 
tion of the disease. It is striking how the same 
environmental stimuli—for example, trans- 
forming growth factor-8 (TGF-8), Notch 
ligands, and Wnt ligands—that finely orches- 
trate EndMT in embryonic heart develop- 
ment might induce pathological reactions of 
endothelial cells in the adult (see the figure). 

An interesting hypothesis is that EndMT is 
the result of unresolved vascular remodeling 
(3). If endothelial cells are exposed for pro- 
longed times to disturbed shear stress (as in 
atherosclerosis) or to other noxious stimuli 
in the blood and lymph or in the organ en- 
vironment, they can undergo EndMT. This in 
turn will act as a feedback mechanism that 
maintains the vasculature in an unstable 
condition through induction of matrix me- 
talloproteases (MMPs) and liberation of ex- 
tracellular matrix-bound and extracellular 
matrix-contained ligands, formation of pro- 
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Shear stress, TGF-B, 
Notch, Wnt--catenin, 
y-interferon, interleukin-1, 
tumor necrosis factor 


Stimuli, phenotype, and cell fate in EndMT 

Endothelial cells may undergo EndMT in response to various stimuli. EndMT can 
occur gradually with multiple and dynamic intermediate stages (characterized by 
various phenotypes) and can result in the acquisition of distinct cell fates. 


EndMT 


~ 


Cell fate 


Phenotype 
Mesenchymal markers, 
stem cell markers, 

loss of contact inhibition, 
altered junctions 


visional extracellular matrix, and expression 
of leukocyte-adhesion molecules and inflam- 
matory cell recruitment, all of which will con- 
tribute to chronic pathological conditioning 
of the vasculature (2). Thus, the same stimuli 
necessary for the correct development of the 
embryo vasculature might induce patho- 
logical reactions in the adult if they are ex- 
pressed for prolonged amounts of time and 
at high levels. It is also possible that EndMT 
represents an inadequate attempt to restore 
functionality after vascular damage. Further 
research is needed to solve these issues. 
EndMT has a number of similarities with 
EMT, which is observed during tissue em- 
bryogenesis and under pathological condi- 
tions, such as invading carcinomas (5, 1/0, 
14, 15). EMT and EndMT are triggered by 
activation of transcription factors—such as 
SNAILI, SNAIL2, zinc finger E-box bind- 
ing homeobox 1 (ZEB1), Kruppel-like fac- 
tor 2 (KLF2), and KLF4—and up-regulated 
expression of a set of mesenchymal and 
stem cell markers, which includes CD44, 
fibroblast-specific protein 1 (FSP1), stem 
cell antigen 1 (SCA1), vimentin, MMP2, and 
MMP9. The coexpression of stem and mes- 
enchymal markers in EndMT and EMT sug- 
gests that in the course of transition to a 
mesenchymal phenotype, endothelial cells 
(such as epithelial cells) can also acquire 
stem cell-like properties. This introduces 
the concept that in specific contexts, mesen- 
chymal status may favor a condition of plu- 
ripotency. However, although extensively 
studied in cancer, the precise relationship 
between the EMT program and the stem 
cell state is not yet clarified and remains to 
be demonstrated in endothelial cells (4, 15). 
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Agents that prevent or revert 
EndMT could be beneficial to 
treat several vascular diseases 
that accompany EndMT (2, 11). 
While the molecular mecha- 
nisms that drive EndMT start 
to be unraveled, inhibitors for 
selective EndMT targeting have 
still to be identified (2, 17). Cur- 
inhibitors of EndMT- 
driving stimuli—such as TGF-B, 
bone morphogenetic proteins, 
inflammatory cytokines or their 
receptors, and Whnt-f-catenin 
signaling—can prevent EndMT 
in vivo and_ revert several 
EndMT traits in cultured cells (2, 
4, 8). However, although preven- 
tion of EndMT can be achieved, 
the full restoration of endothelial 
cell functional identity might be 
more difficult once this is lost 
through EndMT. 

Endothelial cells have a high 
degree of diversity and special- 
ization that are dictated by the surrounding 
cells, nutrients, and blood and lymph flow. 
Thus, the endothelial cells that undergo 
EndMT might not follow identical pathways 
or respond to identical inhibitors or activa- 
tors. The restoration and maintenance of their 
original functional state might therefore be a 
particularly difficult task. Several stimulating 
questions on the process of EndMT still wait 
to be answered. Increasing our knowledge of 
EndMT should help to design more specific 
inhibitors to interfere with the pathological 
occurrence of such endothelial cell fate tran- 
sitions in different conditions such as athero- 
sclerosis, fibrosis in cancer, kidney disease, 
heart disease, and others. 
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IMMUNOLOGY 


A target to suppress inflammation 


A small-molecule inhibitor of the OGG] DNA glycosylase has anti-inflammatory effects 


By Leona D. Samson 


issues that are stressed by injury or 

by invading pathogens elicit signals 

for the recruitment of inflammatory 

immune cells such as macrophages 

and neutrophils, which consequently 

release reactive oxygen and nitrogen 
species (RONS) as part of the innate immune 
response (1). This flood of RONS is important 
for directly attacking invading patho- 
gens and for warding off infection of 
damaged tissue. But the efficiency of 
RONS in inactivating invading cells 
and viruses creates an Achilles heel of 
the innate immune response, namely 
that RONS are also able to kill and 
mutate cells in healthy tissues. Al- 
though inflammatory responses are 
usually counterbalanced over time 
by an opposing anti-inflammatory re- 
sponse, achieving the optimal balance 
with minimal collateral tissue damage 
is difficult even in healthy individuals, 
and virtually impossible in individuals 
with certain disease conditions such 
as ulcerative colitis and rheumatoid 
arthritis (2). Therefore, the develop- 
ment of anti-inflammatory drugs, 
targeting inflammation from a variety 
of different angles, has flourished in 
recent decades. On page 834 of this 
issue, Visnes et al. (3) present an en- 
tirely new approach to suppressing 
the inflammatory response. Counter- 
intuitively, this approach involves the 
inhibition of the 8-oxoguanine DNA 
glycosylase 1 (OGG1) DNA repair en- 
zyme that recognizes and initiates the 
base excision repair of 7,8-dihydro- 
8-oxoguanine (8-oxoG), one of the ma- 
jor types of DNA base damage induced 
by RONS. 

The recognition and binding of 
8-oxoG by OGGI1, which frequently 
occurs in the G-rich promoters of 
proinflammatory genes, facilitates 
the loading of the nuclear factor kB 
(NF-KB) transcription factor that 
stimulates transcription of proin- 
flammatory chemokine and cytokine 
genes, thus promoting an inflamma- 
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Inflammation 


Oxidized guanines 


tory response (4). Visnes et al. identified a 
small-molecule inhibitor that neatly binds 
the active site of OGGI1 and prevents its 
search for 8-oxoG in DNA. Therefore, in- 
hibited OGGI1 cannot bind to the G-rich 
regions adjacent to NF-«B binding sites in 
promoters of proinflammatory genes. This 
inhibitor (called TH5487) robustly inhibits 
the tumor necrosis factor (TNF)-induced 
inflammatory response in cultured mouse 


Inhibition of inflammation 
Inflammation is associated with the generation of RONS, which 
damage G-rich promoters, including those adjacent to NF-KB 

binding sites. OGG1 binding to oxidized guanines facilitates the 
recruitment of transcription factors that drive proinflammatory gene 
expression and a cellular inflammatory response. TH5487 
prevents OGGI1 from interacting with DNA substrates in promoters, 
inhibiting inflammatory gene expression. 


G-rich promoters 
adjacent to NF-kB 
binding sites 


Inflammatory gene expression and tissue inflammation 
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and human lung epithelial cells and, most 
notably, inhibits TNF-induced neutrophilic 
inflammation in an in vivo mouse model. 
Additionally, small-molecule OGG1 inhibi- 
tors have also recently been reported, dem- 
onstrating the potential importance of this 
target (5, 6). 

Central to this study is the hitherto under- 
appreciated role that OGGI1 plays in regulat- 
ing a vigorous transcriptional response to 
oxidative DNA damage (4) (see the fig- 
ure). Hints at the role of OGGI1 in in- 
nate immunity were gleaned from the 
phenotype of mice in which the Oggi 
gene is inactivated (7). We now know 
that evolution has exploited the speci- 
ficity of OGGI1 for finding and binding 
to 8-oxoG lesions in the genome and 
deployed it as a signal for the coordi- 
nated up-regulation of NF-xB-regu- 
lated proinflammatory genes; OGGI1 
thus appears to be a major player in 
launching inflammatory responses. 

That the presence of DNA damage 
should signal the activation of cellular 
responses seems like a tale as old as 
time. Indeed, it is more than 60 years 
since Weigle and Bertani showed 
that bacteriophages exposed to ion- 
izing radiation (which induces DNA 
damage) survive better in irradiated 
versus unirradiated bacterial hosts 
(8). This improved survival occurred 
because preirradiated bacteria launch 
the SOS DNA damage response, which 
enables survival of damaged phage 
DNA. The DNA damage response 
proteins RecA in Escherichia coli and 
the mammalian ataxia-telangiectasia 
mutated (ATM) and taxia telangiec- 
tasia and Rad3 related (ATR) recog- 
nize single-stranded DNA and broken 
DNA ends, abnormal DNA structures 
that are generated by attempted DNA 
replication and transcription of dam- 
aged DNA, and intermediates of DNA 
repair. These proteins may be thought 
of as detecting common kinds of 
damage that arise as a result of a 
myriad of initial lesions that affect 
many sites on each of the four DNA 
bases, the deoxyribose sugar, and the 
sugar-phosphate backbone of DNA. 
However, OGGI1 stimulates a cellular 
response to a specific DNA base lesion 
(8-oxoG) that has not yet been pro- 
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cessed by the DNA replication, transcription, 
and repair machineries, and the transcrip- 
tional response is stimulated directly by the 
interaction of OGG1 with NF-«B, as opposed 
to the signal transduction cascade of events 
that follow recognition of damaged DNA by 
RecA, ATM, and ATR (9). Perhaps the closest 
analog to OGGI1 acting as both a DNA repair 
enzyme and as a transcription regulator is 
the E. coli Ada DNA repair methyltransfer- 
ase that, having transferred a methyl group 
from the DNA sugar-phosphate backbone to 
a cysteine in one of its active sites, acts as 
a transcriptional regulator for a number of 
other genes that promote resistance to meth- 
ylating agents (JO, 11). 

In 2004, inflammation was dubbed the 
“secret killer” (12), owing to established 
links between inflammation and heart dis- 
ease, cancer, Alzheimer’s disease, and many 
other diseases. By exploiting this newly rec- 
ognized role of OGG1 and identifying the 
potent TH5487 inhibitor, Visnes et al. have 
developed a new approach to combating 
the ravages of uncontrolled inflammation. 
Adding an OGGI1 inhibitor to the panoply of 
steroidal and nonsteroidal drugs, plus the 
more recently developed biologics that target 
TNF and other potent mediators of inflam- 
mation, can only help the cause. The obvious 
downside to systemically inhibiting the DNA 
repair function of OGG1 is that potentially 
mutagenic DNA lesions may accumulate 
in the genome, which might accelerate the 
very diseases inflammation has been linked 
to, particularly, cancer. Other enzymes that 
repair 8-oxoG may compensate for inhib- 
ited OGGI and thereby alleviate this poten- 
tial for collateral damage. It is possible that 
transiently inhibiting OGGI in conditions of 
severe inflammation, such as sepsis or severe 
flare-ups in rheumatoid arthritis and other 
autoimmune diseases, may prove beneficial, 
whereas chronic administration of an OGG1 
inhibitor to prevent inflammation may not 
pass the health risk-benefit analysis. Visnes 
et al. have revealed fascinating new depths to 
plumb in the long search for alternative anti- 
inflammatory drugs to alleviate pain and re- 
duce the effects of inflammatory diseases. 


REFERENCES 


1. P.Lonkar, P.C. Dedon, Int. J. Cancer 128, 1999 (2011). 
2. P.Li,Y.Zheng, X. Chen, Front. Pharmacol. 8,460 (2017). 
T.Visnes et al., Science 362, 834 (2018). 
L.Panetal.,J. Biol. Chem. 291, 25553 (2016). 
.Donley et al., ACS Chem. Biol. 10,2334 (2015). 
Y. Tahara et al., J.Am. Chem. Soc. 140, 2105 (2018). 
E. Touati et al., Helicobacter 11, 494 (2006). 
. J.J.Weigle, G. Bertani, Virology 2,344 (1956). 
A.Ciccia, S.J. Elledge, Mol. Cell. 40, 179 (2010). 
. L.Samson, J. Cairns, Nature 267, 281 (1977). 
T. Lindahl, B. Sedgwick, M. Sekiguchi, Y. Nakabeppu, Annu. 
Rev. Biochem. 57, 133 (1988). 
. C.Gormanetal., “Health: The fires within,’ Time, 23 
February 2004; http://content.time.com/time/maga- 
zine/article/0,9171,993419,00.html. 


_— 
POODMNAMAIKW 


a 
ine) 


10.1126/science.aav4744 


SCIENCE sciencemag.org 


CELL BIOLOGY 


Cell 


types behaving in their 


natural habitat 


Defining cell types in situ connects gene expression, 
anatomy, and function during certain behaviors 


By Bosiljka Tasic and Philip R. Nicovich 


full understanding of a complex sys- 

tem is difficult, perhaps impossible, 

to accomplish without an inventory 

of the components involved. When 

that system is an organ in an organ- 

ism, the parts list becomes a census of 
cell types, including their identity, number, 
location, and function. On page 792 of this 
issue, Moffitt et al. (7) demonstrate an ad- 
vanced method for in situ profiling of gene 
expression within the preoptic region of the 
intact mouse hypothalamus. Their approach 
not only yields a census of cell types in this 
region of the brain, but also assesses which 
cell types are activated during certain behav- 
iors. This region of the brain is involved in 
the regulation of homeostasis and social be- 
haviors such as aggression, 
sex, and parenting. 

Recently, comprehensive 
(genome-wide) analysis of 
gene expression in single cells 
for the purposes of cell clas- 
sification has grown rapidly, 
but it is most frequently per- 
formed on dissociated cells 


“Which cell types 
are responsible 
for which 
behaviors, and 


expression levels of different mRNA species 
vary by several orders of magnitude, mean- 
ing that the most abundant mRNAs will be 
preferentially detected (6). However, many 
less-abundant transcripts define cell identity 
and confer specific properties to a cell. 

The common principle of techniques 
to spatially map gene expression is based 
on RNA fluorescence in situ hybridization 
(FISH), whereby single-stranded DNA or DNA 
nanostructures that complementarily hybrid- 
ize with target mRNAs are conjugated to fluo- 
rophores and are detected in tissue samples 
using fluorescence microscopy. These assays 
are not “omic’—genome-wide and unbiased— 
but with thoughtful gene selection they can 
distinguish all cell types within a tissue. 

Moffitt et al. use the now relatively stan- 
dard scRNA-seq combined with their latest 
version of multiplexed error- 
robust FISH (MERFISH) (7 
8) to define cell types in the 
preoptic area of the mouse 
brain. First, they perform 
scRNA-seq to find mRNAs 
that would be most infor- 
mative for distinguishing 
cell types. Then, they design 


(2). The current technique of how do their MERFISH experiments to 
choice is single-cell RNA se- 4 detect these select mRNAs 
quencing (scRNA-seq), which fu nections change and define cell types in situ. 
can detect thousands of Upon exper ience? MERFISH can detect hun- 


different messenger RNA 

(mRNA) species per cell in a relatively un- 
biased fashion. In this process, thousands of 
mRNA species are isolated from individual 
cells and then quantified. Cell classes and 
types are defined on the basis of similar gene 
expression, but the spatial context is lost. 
By contrast, profiling cells in intact tissue 
involves techniques that, to be informative, 
detect only a subset of genes, selected by re- 
searchers (3). The main reason for relying on 
gene selection is that many RNAs in cells are 
“packed” in a very small volume; for exam- 
ple, the volume of a mouse neuron is ~1 to 5 
picoliters (4), and it contains ~200,000 RNA 
molecules (5). It is hard to detect all mRNA 
molecules in such a tiny volume. In addition, 


Allen Institute for Brain Science, 615 Westlake Avenue North, 
Seattle, WA, USA. Email: bosiljkat@alleninstitute.org 


Published by AAAS 


dreds of genes rapidly, ow- 
ing to efficient and robust tissue and probe 
chemistry (7, 8). Because of its high sensitivity 
(six to eight times higher than scRNA-seq), 
MERFISH can focus on low-abundance 
mRNAs and detect many at the same time in 
the crowded space of a single cell. 

Moffitt et al. find that the mouse preoptic 
area has ~70 neuronal cell types, and that 
MERFISH provides better cell-type resolu- 
tion than scRNA-seq. This is the first time 
this brain area has been characterized at this 
level of detail; therefore, many of the cell 
types have not previously been defined. The 
MERFISH results also provided high-reso- 
lution information on cell-type distribution 
and some generalizations: In the preoptic 
area, many excitatory cell types are spatially 
clustered, whereas the inhibitory ones are 
frequently dispersed throughout the area. 
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Some newly defined cell types differ in their 
abundance and distribution between males 
and females (referred to as sexual dimor- 
phism in brain structure) (9). 

The authors were interested in specific 
functions of these newly discovered cell 
types in homeostasis and social behaviors. 
To address these questions, they repeated 
the same cell-type detection experiments 
by MERFISH, but in animals that had un- 
dergone specific treatments or engaged in 
specific behaviors. By detecting gene ex- 
pression that indicates neuronal activity, 
they found cell types that were activated 
in these specific contexts. They found clear 
differences in cell types that were active in 
different behaviors (for example, aggression 
versus parenting) and different organismal 
states (virginity versus postsexual experi- 
ence), suggesting specific functions. The 
study provides avenues for addressing many 
intriguing biological questions. The most 
pressing are: Which cell types are respon- 
sible for which behaviors, and how do their 
functions change upon experience? For ex- 
ample, which cell types and what types of 
changes cause a switch from aggression 
to parenting in males after sexual experi- 
ence? To address these questions, carefully 
designed cell type-specific perturbation ex- 
periments are necessary (see the figure). 

In a broader context, the study of Moffitt 
et al. is a shining example for defining cell 
types in their “natural habitat” that should 
be emulated to create the next generation 
of brain atlases (10, 11). To transform the 
quagmire of unrecognizable neurons, we 
first need to define genes that are expressed 


in them by scRNA-seq. Once the cells are 
defined by genes they express (like linking a 
face to a person), we can define their spatial 
patterns and frequencies by MERFISH or 
similar methods. Then, we can repeatedly 
go back to these cells in different animals 
and assign them other cellular properties, 
such as morphology or connectivity, or pu- 
tative biological function by assaying their 
activity. Finally, to establish their function, 
we need to be able to perturb these cell 
types. Although not utilized in the current 
study, the identifying genes can be used 
to generate genetic tools to perturb cor- 
responding cell types in specific behaviors 
and states. These types of experiments de- 
fine cells that are necessary and/or suffi- 
cient to produce certain behaviors. In the 
future, step by step, marching through the 
brain, the approach outlined here will lead 
to unprecedented descriptions of nervous 
systems and an abundance of salient hy- 
potheses to be tested toward understanding 
nervous system function. 
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Defining behaviorally relevant cell types in the brain 

Once cell types are defined by sets of genes identified by scRNA-seq, their activity in different contexts and 
during various behaviors can be characterized in situ by MERFISH. To define whether the cells are necessary 
or sufficient for specific behavior, additional experiments, such as genetic perturbations, are necessary. 
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Polarimetry 
enabled by 
nanophotonics 


Nanoantenna and plasmonic 
structures can be used to 
measure light polarization 


By Alejandro Martinez 


ight beams consist of oscillatory 

electric (and magnetic) fields having 

a certain amplitude, phase, and fre- 

quency. In transverse waves, the state 

of polarization (SoP) characterizes 

how the electric field oscillates in the 
plane perpendicular to the propagation di- 
rection. Light-matter interactions strongly 
depend on the SoP, so its complete mea- 
surement is of paramount importance in a 
wide array of disciplines including chemis- 
try, imaging, optical communications, and 
astronomy. However, measuring the SoP of 
a light beam, the main goal of polarimetry, 
is much trickier than knowing its intensity 
or frequency, because it involves the simul- 
taneous measurement of the four Stokes 
parameters, which even account for the 
case of unpolarized light. 

For decades, polarimeters have con- 
sisted of a combination of linear retarders, 
polarizers, and quarter-wave plates, which 
were able to obtain the four required mea- 
surements by spatial or temporal split- 
ting of the incoming light beam (J). Such 
macroscopic polarimeters, widely used in 
many applications, are complex, bulky, and 
expensive; there have been few attempts to 
miniaturize them, with the notable excep- 
tion of the fiber-grating polarimeter, highly 
useful in fiber optics (2). Recent advances 
in nanoscience and nanotechnology have 
unveiled new ways to shrink polarimeters, 
with all of the subsequent advantages that 
miniaturization and on-chip integration 
may bring (3). 

Because the optical response of nano- 
structures depends on the polarization of 
the incident light beam, it should be pos- 
sible to retrieve the SoP from outgoing 
signals. Possible polarization states are lin- 
early polarized light along the horizontal 
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(Z,) or vertical (/,,), and right- 
handed (o*) and left-handed 
(o-) circularly polarized light. 
Then, a metasurface (an ar- 
ray of nanostructures) can 
be designed to scatter such 
polarization states into differ- 
ent directions (see the figure, 
left), which enables the re- 
trieval of the Stokes param- 
eters at a certain wavelength 
(4-8). The scattering paths 
could even be parallel to the 
metasurface and guided on 
the chip substrate to facilitate 
on-chip processing and detec- 
tion (5). 

However, the light-matter 
interaction enabling the 
polarization-dependent __re- 
sponse in these implementa- 
tions is distributed among 
all of the elements of the ar- 
ray, so the device footprint is 
much larger than one square 
wavelength. Local nanoscale 
measurement of the SoP requires polar- 
ization-dependent photonic responses 
in individual metallic or dielectric nano- 
structures. For instance, plasmonic nano- 
structures could be engineered in certain 
in-plane shapes that produce optical hot 
spots depending on the SoP of the illumi- 
nation (see the figure, middle). An absorb- 
ing semiconductor placed in the hot-spot 
regions can generate output photocurrents 
proportional to the SoP (9). Although the 
device footprint is much smaller than in 
the case of metasurface po- 
larimeters, this approach still 
requires at least four nano- 
structures to fully retrieve the 


A 


d 


“On-chip 


Nanoscale polarimeters 
Transverse light, with an electric field E, wave vector k, and wavelength A, illuminates a set of nanostructures. In all cases, 

measuring at least four outputs enables the retrieval of the state of polarization. Dielectric and metallic nanostructures are 
depicted in blue and yellow, respectively. 


Scattered output 
metasurface consisting of a set 
of nanoantennas scatters 
ifferent polarization states into 
well-defined spatial pathways. 


all the information needed to retrieve the 
SoP (see the figure, right) (72). In princi- 
ple, there is no limit for downscaling the 
size of the scatterer as long as it partly 
scatters the incoming beam. Remarkably, 
spin-orbit interactions would also enable a 
full-vector description of the incident light, 
beyond the transverse picture (73). 

By a suitable design (materials, shape, 
and size) of the underlying nanostruc- 
tures, the previous approaches can be 
used to build ultracompact polarimeters 
across the entire electromag- 
netic spectrum, even into the 
terahertz regime. Such polar- 
imeters would be extremely 


SoP. Thus, this polarimeter polarimeters broadband; by calibrating the 
would efficiently work for e polarization scattering paths 
transverse light beams but should easily or absorption at each work- 
ae ari Hesges displace their oie len br ee 

e SoP of complex, struc- e used for spectropolarim- 
tured beams that have varia- bulk fr ‘ee-space etry (6, 12). Notably, they may 
a "i the local rina counterparts also . sae aha with 
in the transverse plane ; resonan ielectric nano- 

The SoP at a single spatial because of structures (/4), which would 
oe teihee: ee 
mined by mixing both previ- advantages of tate mass manufacturing in 
ous approaches—that is, by integration...” silicon chips. Although the 


measuring the polarization- 
dependent scattering from 
individual nanostructures. As a result of 
spin-orbit interaction (17), the direction 
of the scattered light paths depends on 
the incident polarization. Moreover, the 
scattering paths can be defined with litho- 
graphically etched waveguides that sup- 
port different guided modes that will carry 
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absorptive approach is highly 
appropriate to build a device 
for SoP measurement that blocks beam 
propagation, the approaches based on 
scattering may operate in a nondestruc- 
tive way with low insertion losses and are 
highly suitable for an in-line configuration. 

On-chip polarimeters should easily dis- 
place their bulk free-space counterparts 
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Absorbed output 
Plasmonic nanoantennas 
can be fabricated in shapes 
designed 
of a certain polarization. 


Waveguide | 
mode 1 


Guided radiation 


Spin-orbit interactions help 
scatter different input 
polarizations into different directions 
and modes of a waveguide. 


to absorb light 


because of the inherent advantages of 
integration, such as low cost, reliability, 
repeatability, and integration with elec- 
tronics (8). Several applications of on-chip 
polarimeters could immediately open up. 
Low-cost in-line polarimeters with low in- 
sertion losses operating at telecommunica- 
tion wavelengths could monitor SoP in real 
time in present and future optical commu- 
nication networks that use polarization 
multiplexing schemes. Spin-orbit polar- 
imeters with near-atomic dimensions may 
locally test the polarization of single pho- 
tons in quantum systems and networks. 
Now that the fundamentals for on-chip na- 
noscale polarimeters have been settled, it 
is time to make them a practical reality. @ 
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RETROSPECTIVE 


Paul G. Allen (1953-2018) 


Cofounder of Microsoft, visionary, and philanthropist 


By Rick Horwitz’, Allan Jones”, 
Tom Daniel* 


n 15 October, Paul G. Allen died after a 

9-year battle with non-Hodgkin’s lym- 

phoma. He was 65. Often dressed in a 

timeless blue shirt and dark pants, the 

cofounder of Microsoft and investor 

in myriad domains didn’t stand out 
by appearance; in conversation, he was soft- 
spoken and reserved. What stood out 
about Paul was his insatiable curiosity, vi- 
sion, breadth of knowledge, and generosity. 
Throughout his life, he focused on identifying 
big problems and making a difference. His 
push to move us out of our comfort zone and 
focus on scientific horizons appearing just 
beyond reach will be greatly missed. 

Paul was born in 1953, in Seattle, Wash- 
ington. His father was associate director of 
the University of Washington Libraries. He 
attended Lakeside School where he met Bill 
Gates, who was 2 years younger. The two 
became passionate about computing, which 
was done on the large mainframes that were 
seen as the future of the field. Paul took his 
perfect SAT score to Washington State Uni- 
versity for college but dropped out after 
2 years and moved to Boston to work near 
Bill, who was attending Harvard. In 1974, the 
cover of an upcoming issue of Popular Elec- 
tronics showed the Altair 8800, touted as the 
world’s first microcomputer. Paul rushed over 
to show it to Bill and quickly envisioned that 
this could lead to a computer on every desk. 
Struck by the potential opportunities, Bill 
dropped out of school as well. Whereas most 
of the interest at the time was in improving 
the hardware, Paul and Bill saw the need for 
software that would democratize computing, 
making the computer accessible to every- 
one and allowing it to play a pervasive role 
in writing, communication, and accounting. 
They launched Microsoft in 1975, and in 1981 
their operating system was adapted by IBM 
for its new personal computer. 

In 1983, Paul was diagnosed with Hodg- 
kin’s lymphoma and retired from Microsoft. 
He formed Vulcan, Inc., in 1986. The privately 
held company oversaw his wide swath of 
business and philanthropic activities, which 
reflected his passions. These included pop 
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culture (Museum of Pop Culture, or MoPOP), 
music (Upstream Music Fest), art (Seattle 
Art Fair), sports (Seattle Seahawks, Portland 
Trail Blazers, and Seattle Sounders FC), mov- 
ies (Cinerama), film (Vulcan Productions), 
airplanes (Stratolaunch and Flying Heritage 
Collection), the environment, and climate 
and conservation. All of these endeavors were 
marked by Paul’s desire to do it differently 
and make a big impact. 

Paul also funded a multitude of scientific 
initiatives, including the Allen Institute for 
Brain Science, Allen Institute for Cell Science, 
the Allen Distinguished Investigators, the Al- 
len Institute for Artificial Intelligence, and 
the Paul G. Allen Frontiers Group, along with 
diverse workshops. All such ventures focused 


on fundamental questions about the nature 
of computations and the inherent complex- 
ity of living systems. The guiding principles 
of his initiatives included team science and 
open science. The Allen institutes diverged 
from the typical academic university or insti- 
tute, practicing industrial-scale science with 
clearly specified objectives and time lines. 
In the early 2000s, for example, the Allen 
Institute for Brain Science introduced large- 
scale team science to create an atlas of gene 
expression in the mouse brain. The atlas was 
presented in an easily accessible and useful 
format, providing a unique resource for the 
neuroscience community. 

Paul had a passion for science; had life 
played out differently, he might have been 
a professor at a university. He was always 
pushing for “big ideas” and “ways in.” He was 
deeply interested in unlocking the myster- 
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ies of wildly complex processes, pressing us 
to address the “unsolved mysteries” and find 
the “codes” in biological sciences. His inter- 
ests spanned from brains to cells, evolution 
to artificial intelligence, and elephants to 
oceans, areas in which he took both an intel- 
lectual and personal interest. 

Paul was most animated when a small 
group of colleagues was circling around a 
new idea, something he had spurred as part 
of his uncanny ability to question dogma. A 
visit to his office often ended up at the white- 
board, which we would fill with diagrams of 
cellular processes. He was right there at the 
board with us, drawing ideas and pacing. 
And there on the windowsill sat elephant 
statues and awards for discoveries and films, 
reminding us all of the vast range of concerns 
he had for living things and for science. 

His style was to engage small groups of ex- 
perts in charrettes (multiday planning meet- 
ings), listening intently to the arguments for 
and against various ideas and approaches. 
Once a topic was chosen and a plan devel- 
oped, it would be presented to Paul, who 
then asked the hard questions about feasibil- 
ity, strategic advantage, risks, and alternative 
approaches. He entered these meetings well 
prepared: Rather than have us simply reca- 
pitulate the plan, he went right to the few key 
ideas, asking for embellishment and clarity. 
He would ask “How will we know if we're 
successful?” and “Is this the right time?” His 
approach drove a discipline of thought and 
focus that spurred the success of his many 
endeavors. In these meetings, Paul showed 
his enormous intellectual breadth and cre- 
ativity as he looked for new ways to think 
about the problem, conjoin disciplines, and 
penetrate deeply into the topic. 

Although we only saw Paul occasionally, he 
emailed often, at nearly any time of the day 
or night. He sent articles he had read or asked 
about talks he had heard. The emails came 
with questions about our thoughts, how we 
could make a difference, what the roadblocks 
would be, or how much it would cost. 

It is comforting to know that what Paul 
started will live on through the research 
and discovery enterprises he fostered and 
through the people in whom he invested. 
We wish we had had more time with him; he 
never shied away from pushing us to think 
big and tackle unsolved mysteries in science. 
Paul always reminded us, as he wrote in his 
1 April 2016 Editorial in Science, that “all of 
us—philanthropists, governments, univer- 
sities, and private companies alike—must 
invest much more in basic, fundamental sci- 
ence and in the intrepid scientists who are 
willing to pursue out-of-the-box approaches 
at the very edges of knowledge.” 


10.1126/science.aav8254 
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Preventing chemical weapons 
as sciences converge 


Focus must extend beyond 20th-century technologies 


By Michael Crowley, Lijun Shang, 
and Malcolm Dando 


tark illustrations of the dangers from 

chemical weapons can be seen in at- 

tacks using toxic industrial chemicals 

and sarin against civilians and com- 

batants in Syria and toxic industrial 

chemicals in Iraq, as well as more 
targeted assassination operations in Malay- 
sia and the United Kingdom, employing VX 
and novichok nerve agents, respectively. Con- 
cerns about such malign applications 
of chemical technology are exacer- 
bated by the unstable international 
security environment and the chang- 
ing nature of armed conflict, “where 
borderlines between war, civil war, 
large-scale violations of human 
rights, revolutions and uprisings, in- 
surgencies and terrorism as well as 
organized crime are blurred” (J). It 
is thus essential that the global com- 
munity regularly review the nature 
and implications of developments in 
chemistry, and its convergence with 
the life and associated sciences, and 
establish appropriate measures to 
prevent their misuse. With the par- 
ties to the Chemical Weapons Con- 
vention (CWC) convening a Review 
Conference to address such issues beginning 
21 November 2018, we highlight important 
scientific aspects (2). 


COMPREHENSIVE PROHIBITION 

The CWC is a multilateral treaty in effect 
since 1997 that proscribes the development, 
production, stockpiling, transfer, and use 
of chemical weapons “under any circum- 
stances” and requires their destruction 
within a specified time period. The CWC al- 
lows the use of toxic chemicals for a range 
of industrial, agricultural, research, medical, 
pharmaceutical, or other peaceful purposes, 
including law enforcement, as long as the 
“types and quantities” of chemicals employed 
are “consistent with such purposes.’ The 
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CWC has declaration requirements (obliging 
States to detail facilities that produce or use 
certain chemicals of concern, grouped into 
three “schedules,” based on the risk they pose 
to the CWC), industry inspections, and other 
verification measures to ensure that toxic 
chemicals and related technologies are not 
misused in weapons production and to inves- 
tigate alleged chemical weapons use. 

The Organisation for the Prohibition of 
Chemical Weapons (OPCW), which is the im- 
plementing body of the CWC, comprises the 


3 


Russian special forces remove hostages from a Moscow theatre following 
use of aerosolized anaesthetics to end an armed siege in October 2002; 
125 of the 900 hostages died as a result of the chemicals employed. 


193 State Parties and a Technical Secretariat 
that provides technical assistance to States, 
routinely inspects relevant State and com- 
mercial industrial facilities, and monitors ac- 
tivities to ensure compliance. It was awarded 
the Nobel Peace Prize in 2013 for oversee- 
ing and facilitating the verified destruction 
of most of the declared chemical weapons 
stocks produced in the last century—to date 
totaling more than 96% (69,750 metric tons) 
of the declared stockpiles of chemical agents. 

Now that this first phase of the CWC’s 
implementation is nearing completion, the 
OPCW has to increasingly focus on prevent- 
ing the reemergence of chemical weapons. 
Thus, in addition to combatting possession 
and employment of 20th-century chemical 
weapons types, the OPCW must also address 
a changing external environment where the 
risks associated with toxic chemicals and 
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their potential misuse as weapons are be- 
coming more diffuse and less well defined in 
terms of chemical compounds or dissemina- 
tion methods. To respond effectively to such 
shifts and maintain the comprehensive na- 
ture of the chemical weapons prohibition, 
the OPCW should prioritize key issues below. 


Ensure effective implementation of the 
General Purpose Criterion 

Although the CWC includes three schedules 
of toxic chemicals for the application of veri- 
fication measures, the scope of the CWC is 
not constrained to these schedules but by 
its General Purpose Criterion (GPC), which 
prohibits misuse of toxic chemicals based 
on intent rather than on this limited list of 
chemicals (3). The CWC negotiators sought 
to ensure that the CWC could accommodate, 
and the States Parties be able to respond to, 
future developments in chemistry, biology, 
and associated sciences and technologies. 
Consequently, “even toxic chemicals whose 
existence is not yet known are covered,” 
while “legitimate uses of all toxic chemicals 
and chemicals from which they can 
be made” are protected (4). 

But there is contested interpreta- 
tion of the GPC as to the range of 
chemicals and delivery mechanisms 
that could be employed for law en- 
forcement, and the nature of what 
constitutes legitimate use. The up- 
coming Review Conference should 
thus establish an Open-Ended Work- 
ing Group (OEWG) involving sci- 
entific experts to design guidelines 
to prevent research, development, 
production, and employment activi- 
ties that, while purportedly intended 
to support law enforcement, would 
undermine the prohibitions of the 
CWC. The OEWG should consider 
existing obligations under interna- 
tional law, notably, international human 
rights law (IHRL), and their bearing on the 
CWC. The OEWG should specifically address 
the following: 

1) Riot control agents (RCAs). The CWC 
defines RCAs—such as tear gas and pepper 
spray—as “any chemical not listed” in one of 
its three schedules that can produce “rapidly 
in humans sensory irritation or disabling 
physical effects which disappear within a 
short time following termination of exposure.” 
Thus, chemicals should only be employed as 
RCAs if there is solid scientific evidence to 
show that such agents are not dangerous to 
humans when used in an appropriate man- 
ner. The CWC permits their use for “law en- 
forcement including domestic riot control” 
(5), as long as the “types and quantities” (6) 
are consistent with such purposes. However, 
a recurring concern documented by the med- 
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ical community and human rights monitors 
has been the widespread misuse of RCAs by 
police and security forces in excessive quan- 
tities, including in hospitals, prisons, homes, 
and automobiles, where targeted individuals 
cannot disperse. In such situations, serious 
injury or death can result from toxic proper- 
ties of chemicals or from asphyxiation. 

Although a variety of chemicals were de- 
veloped, considered, or used as RCAs in the 
past century, the OPCW Scientific Advisory 
Board (SAB, comprised of independent ex- 
perts) clarified that only 17 chemicals from 
the 60 it examined were consistent with the 
RCA definition under the CWC (7). For exam- 
ple, certain States designated Adamsite (DM) 
as an RCA, but it has been removed from 
this category because of its danger to human 
health [(7), appendix 3]. The OEWG should 
now clarify the nature and scope of “law en- 
forcement” activities and develop guidance 
as to “types and quantities” of RCAs that can 
legitimately be used in such circumstances, 
highlighting obligations under IHRL. 

2) Delivery systems. If the OPCW does 
not take appropriate action on RCAs, the 
situation could dramatically worsen as a re- 
sult of ongoing development and marketing 
of systems capable of delivering far greater 
amounts of RCAs (and potentially other toxic 
chemicals) over wider areas or more extended 
distances than current standard law enforce- 
ment delivery mechanisms, such as hand- 
held sprays, grenades, and single launched 
projectiles. Such new systems include large- 
capacity spraying devices, automatic grenade 
launchers, multibarrel projectile launchers, 
large-caliber RCA projectiles, and unmanned 
ground or aerial vehicles capable of carry- 
ing spraying devices or projectile launchers 
(8). The OEWG should develop criteria for 
determining which means of delivering and 
dispersing RCA are inconsistent with the 
purpose of law enforcement and should thus 
be prohibited. Such prohibited means of de- 
livery should, at a minimum, include artillery 
shells, aerial bombs, mortar shells, and clus- 
ter munitions. 

3) Incapacitating chemical agent (ICA) 
weapons. Although the CWC permits use of 
appropriate types and quantities of RCAs 
for law enforcement, certain countries have 
conducted research into weapons employ- 
ing other distinct toxic chemicals, so-called 
ICAs. Not separately defined under the CWC, 
ICAs can be considered as a range of toxic 
chemicals—only one of which [3-quinuclidi- 
nyl benzilate (BZ) and two of its immediate 
precursors] is currently scheduled—includ- 
ing anesthetics and other pharmaceutical 
chemicals that are purportedly intended to 
act on the body’s core biochemical and physi- 
ological systems, notably the central nervous 
system (CNS), to cause prolonged but non- 
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permanent disability. Such CNS-acting chem- 
icals can produce unconsciousness, sedation, 
hallucination, incoherence, disorientation, 
or paralysis. With inappropriate doses, how- 
ever, death can result. Leading medical and 
scientific organizations have highlighted 
grave dangers to health and well-being of 
such weapons (9); in the only confirmed ex- 
ample of their large-scale use, an aerosolized 
mixture of two anesthetics—carfentanil and 
remifentanil—employed by Russian security 
forces to end the Moscow theatre siege of 
October 2002 caused the deaths of 125 of the 
900 hostages (10). 

As the U.S. Ambassador to the OPCW 
noted in October of this year, “The United 
States and many other States Parties are seri- 
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A Syrian man mourns children killed in a chemical attack on the town of Douma, Syria, in April 2018. Advances in 


able to alert their CWC National Authority 
and the OPCW Technical Secretariat to po- 
tential dangers. The OEWG could determine 
either that development, stockpiling, trans- 
fer, and use of ICA weapons for law enforce- 
ment are prohibited under the CWC or that 
such actions are permitted but should be se- 
verely restricted. 


Improve OPCW monitoring and risk assess- 
ment of science and technology 

In 2011, an expert panel recommended that 
the OPCW should “improve and widen the 
scope of monitoring and evaluating develop- 
ments in chemical science and technology” 
(12). In 2013, in response to such concerns, 
the OPCW appointed a Science Policy Ad- 
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science and technology could aid OPCW investigations into such attacks and help uncover those responsible. 


ously concerned that some States may be de- 
veloping these chemicals for warfare...while 
cloaking their efforts as legitimate activities 
such as law enforcement” (77). The U.S. con- 
cern is reflected in a recent Department of 
Defense solicitation that “seeks to develop 
field diagnostic capabilities for detection of 
exposure to the ever-growing opioid class of 
chemical threat agents.” 

The chemical threat spectrum includes 
bioregulators and toxins, and our increas- 
ing understanding of the CNS is likely to un- 
cover many more potential targets and agent 
classes that might be weaponized. Scientists 
should be aware of such possibilities and be 
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viser at the Technical Secretariat. In addition, 
over the past 5 years, the SAB has regularly 
provided technical reports and briefings on 
key scientific and technological (S&T) devel- 
opments. Building upon these advances, and 
informed by SAB recommendations (13), the 
OPCW should consider further measures 
to strengthen the Technical Secretariat’s 
capability to monitor and forecast S&T de- 
velopments and their implications, and to 
strengthen its ability and mandate to proac- 
tively bring specific cases of concern to the at- 
tention of the States Parties. However, this is 
not something that the OPCW can do alone, 
given the range of scientific disciplines and 


sciencemag.org SCIENCE 


PHOTO: MOHAMMED HASSAN/UPI/NEWSCOM 


technologies that need to be monitored, their 
complexity, their rapidity of advance, and the 
geographical scope of research and develop- 
ment. Thus the nongovernmental chemical 
and life scientific community, in particular, 
has an important role to play by undertak- 
ing technology tracking of generic trends in 
technologies of relevance to the CWC (as has 
been undertaken previously by the Interna- 
tional Union of Pure and Applied Chemistry 
in preparation for previous CWC Review 
Conferences), and by undertaking targeted 
research into S&T developments of particular 
concern, for example, in the fields of medici- 
nal chemistry, pharmacology, synthetic biol- 
ogy, nanotechnology, and, as undertaken by 
the Royal Society, neuroscience (9). 


IMPLEMENTATION AND VERIFICATION 
Advances in S&T may have several effects on 
national implementation of the CWC by its 
States Parties and on how the OPCW verifica- 
tion mechanisms function, which the Review 
Conference should address. 


Update industry verification measures 
OPCW verification measures currently focus 
on the list of scheduled chemicals, which were 
previously identified from past State chemi- 
cal weapons programs. But new production 
pathways to old chemical warfare agents may 
become feasible as a result of technological 
advances; alternatively, new potential chemi- 
cal warfare agent types may become relevant 
involving toxic chemicals not listed on any 
of the schedules. Consequently, the routine 
industry verification regime (as well as the 
analytical methods and databases available 
for challenge inspection and for investiga- 
tion of alleged chemical weapons use) need 
to be adapted to these new technological and 
chemical realities. The SAB has suggested 
that “efforts to ensure that the verification 
regime remains effective would benefit from 
more extensive engagement with techni- 
cal experts from industry, and review of 
industry-focused research and development, 
including the driving forces for adoption of 
new technologies into industrial processes” 
(73). Favorable consideration should also be 
given to updating the schedules themselves 
(13), at least to provide indicators of the 
new or additional types of potential chemi- 
cal agents (and their precursors) of concern, 
such as the novichock agent (and its associ- 
ated families). 


Other chemical production facilities 

Other chemical production facilities (OCPFs) 
are chemical plants that do not currently 
produce, but are capable of manufacturing, 
chemical warfare agents or precursors. At 
present, a small fraction of declared OCPFs 
are selected for verification by the OPCW; 
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the Review Conference should consider au- 
thorizing a substantial increase in OCPF in- 
spections per year. The OPCW should also be 
directed to refine the process of site selection 
so as to target inspections on multipurpose 
chemical plants that pose the greatest risk of 
being utilized for prohibited purposes. 


Biological and biologically mediated 
processes for production of discrete 
organic chemicals 

Some products and processes used by the 
biomanufacturing industry are as relevant to 
the CWC as those used by other OCPF facili- 
ties, including those the Technical Secretariat 
considers pose notable risks. Thus, the SAB 
has consistently recommended that bioman- 
ufacturing of chemical products should be 
covered under the scope of the CWC. How- 
ever, States Parties have yet to agree on how 
to treat these types of production processes 
and facilities. The Review Conference should 
follow SAB advice and establish measures to 
determine the relevance of various types of 
biomanufacturing processes and facilities for 
CWC verification purposes. 


PREVENTING AND RESPONDING 

The OPCW should continue improvements 
in the operational and technical capac- 
ity of the Technical Secretariat to conduct 
challenge inspections and investigations 
of alleged use of chemical weapons, with 
an increased focus on chemical forensics. 
An important development in this regard 
was the June 2018 decision to empower the 
OPCW to develop an attribution mechanism 
to determine who conducted a chemical at- 
tack (14). Ongoing work by the SAB into op- 
portunities and difficulties associated with 
chemical forensics will enable the OPCW to 
most effectively utilize new tools and meth- 
ods. Such work, which would beneit from 
wide consultation with scientific experts, 
must be complemented by efforts to com- 
pile, expand, and properly curate the data- 
bases of reference spectra and collections of 
reference materials that will be needed for 
such forensic analysis. 

The SAB highlighted the critical role of 
biomedical samples in investigations of al- 
leged use of toxic chemicals and recom- 
mended that the Technical Secretariat should 
“actively encourage further research on po- 
tential markers of exposure to such chemi- 
cals.” The OPCW should also build on the 
considerable progress made toward devel- 
oping a network of designated laboratories 
for the analysis of biomedical and biological 
samples (15). Advances in other fields could 
also facilitate more effective evidence collec- 
tion, for example, exploring the potential of 
unmanned aerial vehicles to support recon- 
naissance, detection, and chain of custody. 
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The OPCW should also consider how best 
to strengthen the resilience of States against 
hostile use of toxic chemicals. This could 
include expanding the number of viable 
national protective programs supported by 
OPCW training and capacity building, and 
the establishment of well-equipped regional 
capacities for effective response to the use of 
chemical weapons or the accidental release of 
toxic chemicals. 


SCIENTISTS AND AN EVOLVING OPCW 

The OPCW faces the task of deciding how 
best to evolve to prevent the reemergence of 
chemical weapons in a period of rapid scien- 
tific change and unstable international secu- 
rity. There is growing recognition within the 
OPCW of the vital importance of engaging 
with and ensuring the support of the world- 
wide scientific community, in particular via 
relationships with professional societies (13). 
Chemical and life scientists could play their 
part by being better informed of the issues at 
stake, and by ensuring that their colleagues 
and students are alerted to the dangers of 
the misuse of dual-use technologies and are 
implementing relevant ethical codes, codes 
of conduct, and the Hague Ethical Guidelines 
recently developed by the OPCW to promote 
a culture of responsible conduct in the chem- 
ical sciences and to guard against the misuse 
of chemistry for malign intent. 
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Approaching the singularity 


New data, old rivalries, and enduring questions fill a 
welcome overview of black hole research 


By Jeremy Schnittman 


hat happens at the black hole 

event horizon, where time stands 

still and intuition breaks down? 

In the opening chapter of his 

new book, Einstein’s Monsters, 

astronomer and popular science 
writer Chris Impey puts the problem suc- 
cinctly: “It made no sense for a 
physical object to have zero size 
and infinite mass density. Ein- 
stein’s theory had created some- 
thing monstrous.” 

Impey does an admirable job 
describing multiple facets of the 
often contradictory field of black 
hole astrophysics, including its 
history, science, and colorful hu- 


CHRIS IMPEY 


Einstein’s Monsters 


escape one—is not entirely true. In the 1970s, 
Stephen Hawking proved that black holes of 
all sizes give off a faint glow of extremely low- 
temperature blackbody radiation. 

In the far distant future, every black hole 
in the Universe will eventually evaporate, 
leaving nothing behind but a dilute, near- 
uniform bath of radio waves. A related par- 
adox: An astronaut plunging into a black 
hole would feel nothing as she 
slips quietly past the event hori- 
zon and reaches the singularity 
only a few moments later, yet to 
her crew mates watching from 
the safety of the nearby mother- 
ship, it would take an eternity for 
her to even reach the horizon. 
Like Zeno’s tortoise, she would 
seem to get closer and closer, 


man interactions. For example, we The Life and Times fainter and fainter, but never 
now know that most astrophysical on Black Holes quite disappear. 
Chris Impey 


black holes are notoriously diffi- 
cult to detect and observe, and yet 
some are the brightest x-ray sources in the 
sky and can be seen from across the entire 
universe. They seem to come in two sizes: 
small and extra-extra-extra-large. The small 
ones have huge densities, whereas the largest 
ones are lighter than air. 

Even the (seemingly) most fundamental 
fact about black holes—that not even light can 
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In addition to these physical 
contradictions, the book also cov- 
ers many of the field’s more interesting hu- 
man conflicts and competitions throughout 
the past century: Eddington versus Chan- 
drasekhar, Zwicky versus Sandage, Thorne 
versus Hawking, even Einstein versus Ein- 
stein. Impey paints a colorful picture of the 
personalities involved, including personal 
anecdotes from his own firsthand interac- 
tions with many of the leading actors of 
the story. 

One may reasonably ask, “Does the world 
really need another popular science book 
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Acomputer-simulated image depicts a supermassive 
black hole at the center of a galaxy. 


about black holes?” Anyone who has read 
and enjoyed Kip Thorne’s gold standard, 
Black Holes and Time Warps, will learn 
relatively little from Einstein’s Monsters. 
Yet the intended target audience is more 
likely to have read Hawking’s A Brief His- 
tory of Time (if they have read anything on 
the topic) and thus may not be familiar with 
many of the critical observational discover- 
ies over the past half-century. 

For the next generation of popular as- 
tronomy buffs, Einstein’s Monsters is a rea- 
sonable entry point, covering a broad—if 
not particularly deep—range of theoretical 
and observational topics in black hole re- 
search. Particularly welcome, even for more 
experienced black hole aficionados, are the 
excellent chapters about the Laser Interfer- 
ometer Gravitational-Wave Observatory’s 
recent discovery of gravitational waves and 
the Event Horizon Telescope’s imminent 
discovery of black hole shadows. 

Impey’s broad and relatively cursory ap- 
proach to black holes mirrors his earlier 
works in popular astronomy, this being his 
seventh book in about as many years. Un- 
fortunately, this prolific productivity is occa- 
sionally betrayed by factual errors in the text, 
especially in the more theoretical passages. 
In chapter 8, for example, Impey claims that 
fermions and bosons cannot interact with 
each other, when in fact that is exactly how 
photons and gluons convey the fundamental 
forces of nature. Elsewhere, he asserts that 
Mercury’s precession is 5600 arc sec per cen- 
tury, which is off by an order of magnitude. A 
number of theoretical results—for example, 
using x-ray oscillations to measure black 
hole mass and spin—are also quoted as es- 
tablished fact despite widespread skepticism 
in the research community. 

Certain passages, and even chapters, of 
Einstein’s Monsters have a distinctly hap- 
hazard feel to them, throwing together 
a collection of topics without an obvi- 
ous theme. Chapter five jumps from y-ray 
bursts to intermediate-mass black holes to 
microquasars to numerical relativity and 
then cosmological N-body simulations. Per- 
haps the goal here is to impress the reader 
with how important black holes are in mod- 
ern astronomy, but it gives the impression 
of disorganization. 

Despite these few shortcomings, Ein- 
stein’s Monsters will be sure to capture the 
imagination of most who pick it up, simul- 
taneously convincing the reader that these 
monsters, while in fact quite certainly real, 
should be loved and not feared. 
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COSMOLOGY 


A physicist’s final reflections 


An unfinished tome reveals the late Stephen Hawking’s musings on life’s biggest mysteries 


By Andrew Robinson 


he death of cosmologist Stephen 
Hawking earlier this year happened to 
fall on the birthday of Albert Einstein. 
This felt like an appropriate coinci- 
dence, given the centrality of Einstein’s 
general theory of relativity in Hawk- 
ing’s much-celebrated life as a scientist. 
Einstein is mentioned in Hawking’s post- 
humously published book—Brief Answers 
to the Big Questions, which he left unfin- 
ished—far more frequently than any other 
scientist, past or present, including Isaac 
Newton, Hawking’s illustrious predecessor 
as Lucasian professor of mathematics at the 
University of Cambridge. Indeed, the con- 
cluding “big question” of the 10 explored 
in 10 chapters, “How do 
we shape the future?” be- 
gins with Einstein. “Where 
did his ingenious ideas 
come from?” asks Hawk- 
ing. He answers, “A blend 
of qualities, perhaps: intu- 
ition, originality, brilliance. 
Einstein had the ability to 
look beyond the surface to 
reveal the underlying struc- 
ture. He was undaunted 
by common sense, the idea 
that things must be the 
way they seemed. He had 
the courage to pursue ideas 
that seemed absurd to oth- 
ers. And this set him free 
to be ingenious, a genius of 
his time and every other.” 
Was Hawking a genius, 
too? He never won a Nobel Prize, and the 
book gives no indication that Hawking re- 
garded himself as a genius. On the other 
hand, he was one of the very few scien- 
tists since Einstein to become a household 
name. As his close collaborator, Nobel lau- 
reate Kip Thorne, remarked in his eulogy: 
“Newton gave us answers. Hawking gave us 
questions. And Hawking’s questions them- 
selves keep on giving, generating break- 
throughs decades later. When ultimately 
we master the quantum gravity laws, and 
comprehend fully the birth of our universe, 


The reviewer is the author of Einstein: A Hundred Years of 
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it may largely be by standing on the shoul- 
ders of Hawking.” 

Hawking was well known for two ad- 
ditional reasons unrelated to his mind- 
boggling cosmological theories. The first 
was his 1988 book, A Brief History of Time, 
an international bestseller that sought to 
explain the physics of time to the general 
reader without using mathematical equa- 
tions. The second was his courageous 
struggle with motor neuron disease, which 
rendered him wheelchair-bound and de- 
pendent on a computer screen and speech 
synthesizer to communicate. “[A]s some- 
one who at the age of twenty-one was told 
by their doctors that they had only five 
years to live, and who turned seventy-six 
in 2018, I am an expert on time in an- 


Hawking enjoys a moment of weightlessness in zero gravity in 2007. 


other sense, a much more personal one,” 
he writes. “I am uncomfortably, acutely 
aware of the passage of time, and have 
lived much of my life with a sense that the 
time that I have been granted is, as they 
say, borrowed.” 

Some of Hawking’s “big questions” and 
answers are firmly rooted in science—for 
example, “What is inside a black hole?” 
and “Is time travel possible?”—whereas 
others inherently cannot be, such as “Will 
we survive on Earth?” “Should we colonise 
space?” “Will artificial intelligence out- 
smart us?” and “Is there a God?” To the 
last question, he answers, “If you like, you 
can call the laws of science ‘God, but it 
wouldn’t be a personal God that you would 
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meet and put questions to. Although, if 
there were such a God, I would like to ask 
however did he think of anything as com- 
plicated as M-theory in eleven dimensions.” 

Certain of Hawking’s assertions may be 
considered questionable and at times myo- 
pic. As an ardent advocate of space travel, 
he believes, “Not to leave planet Earth 
would be like castaways on a desert island 
not trying to escape.” Indeed, he goes much 
further and claims that space colonization 
is the only hope for the sur- 
vival of the human race af- 
ter the “almost inevitable” 
destruction of Earth, which 
he predicts will happen 
within the next 1000 years. 

Regarding artificial intel- 
ligence (AD), he anticipates, 
“AI may automate our jobs, 
to bring both great pros- 
perity and equality” in the 
medium term. Looking fur- 
ther ahead, he writes, “the 
future of communication is 
brain-computer interfaces. 
... If we can connect a hu- 
man brain to the internet 
it will have all of Wikipedia 
as its resource.” Perhaps it 
is not surprising that some- 
one who was intimately de- 
pendent on information technology should 
have held such an opinion about its future. 
Although Hawking acknowledges poten- 
tial negative scenarios, referring to HAL, 
the highly intelligent computer in the film 
2001: A Space Odyssey that unsuccessfully 
attempts to destroy its human masters, he 
remarks merely, “but that was fiction. We 
deal with fact.” 

Nonetheless, the final testament of this 
unique scientist is well worth reading. 
One cannot help but be moved by Hawk- 
ing’s lifelong struggle to lead a creative life. 
“[R]emember to look up at the stars and 
not down at your feet,’ he sums up. 


10.1126/science.aav7499 
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Climate change drives 
tree mortality 


In their Report “Classifying drivers of 
global forest loss” (14 September, p. 
1108), P. G. Curtis et al. reported a global 
assessment of forest loss from 2001 to 
2015. They attributed 99% of the loss to 
land-use change and wildfire, and they 
urge companies to eliminate 5 million 
hectares of land-use change per year to 
prevent further deforestation. Their analy- 
sis underestimates climate change-driven 
drought, storms, and insect epidemics, 
which also contribute to substantial tree 
mortality each year. 

During the period Curtis e¢ al. 
investigated (2001 to 2015), climate 
change-driven forest loss increased, 
affecting large forest areas across the 
globe (1). Growing evidence shows that 
increasingly hot droughts killed most 
of the trees in an area of 1.2 million hec- 
tares in southwest North America (2), 
and warming-driven mountain pine beetle 
outbreaks affected forests in northwest 
North America at a rate of 6 to 7 mil- 
lion hectares per year between 2005 and 
2008 (3). Additional forest losses of large 
magnitude due to drought, ice, snow, and 
wind storms occurred in China, Spain, 
Chile, and other countries (7). The order 
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of magnitude of these forest loss events is 
similar to the estimated deforestation rate 
of 5 million hectares a year reported by 
Curtis et al. 

One potential cause of the substantial 
underestimation of climate change-driven 
forest loss is our limited ability to detect 
it with remote sensing tools (4), which 
Curtis et al. used for their analysis. Unlike 
most land-use change forest losses and 
wildfire, climate change-driven tree 
mortality is often diffuse and gradual. 
Although measurements on the ground 
are sufficient to infer that climate 
change has caused tree loss [e.g., (J-3)], 
algorithms and analysis tools that rely 
on remote sensing data are still under 
development. In some cases, Curtis et 
al’s analysis places climate change- 
induced changes in other categories. For 
example, they attributed both mountain 
pine beetle damage in Canada (3) and 
drought-induced tree mortality near Los 
Alamos, New Mexico (2), and Beetle Rock, 
California (1), to forestry. 

The analysis presented by Curtis et al. pro- 
vides only a partial view of the magnitude 
and causality of global forest loss. Ground- 
based data from national forest inventories 
and research plot networks, combined with 
improved remote sensing image analysis, are 
essential to identify diffuse forest losses due 
to climate change. Given that global tem- 
peratures continue to rise and droughts are 
expected to occur more frequently and with 


Published by AAAS 


higher severity (5), quantifying and monitor- 
ing these forest losses could potentially 
become even more important than control- 
ling man-made deforestation. 
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Halt speculation on 
U.S. embassy in Cuba 


Since 2016, the U.S. State Department has 
reported that 25 staff at the U.S. embassy in 
Havana have complained of symptoms such 
as hearing loss and vertigo (1). Embassy 
staff have reported hearing unusual and 
unsettling sounds at home or at hotel 
rooms in addition to their workplaces at 
the embassy (2). In the ensuing 2 years, 
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Pine beetle infestations driven by climate 
change can lead to deforestation. 


scientists have allowed speculation about 
the causes of these health issues to outpace 
the evidence. 

Neuroscientists at the University of 
Pennsylvania (the affiliation of K.R.F., who 
was not involved in the study) reported evi- 
dence for brain injury related to perceived 
sound in some of the affected individuals 
(2), but this work has been firmly contested 
by other scientists, some of whom advance 
other, more mundane explanations for 
the symptoms (3-5). An editorial in the 
neuroscience journal Cortex notes internal 
inconsistencies in the neuroscience evidence 
published so far (6). After nearly 2 years of 
investigation, neither Cuban nor U.S. offi- 
cials have identified the cause of the health 
problems or even provided convincing 
evidence that the diverse health problems 
reported by the staff have a common cause 
(7). While acoustic or electromagnetic fields 
might conceivably have produced audible 
sounds at the embassy, no physical agents 
have been reported at levels that might 
plausibly have injured the employees. 
Evidence available to the public remains 
largely anecdotal. 

Nevertheless, discussion by scientists and 
the media about the cause of the reported 
health problems has been characterized by 
speculation and unwarranted inferences 
about possible effects of physical agents 
supposedly directed at the employees [e.g., 
(8, 9)]. Such speculation is unhelpful in 
treating the affected individuals and hinders 
relations between the two countries. The 
affected members of the embassy staff need 
careful medical follow-up without presump- 
tions about the etiology of their problems, 
and the U.S. embassy needs a careful 
occupational health assessment to iden- 
tify any potential health risks. Alternative 
explanations (including preexisting diseases 
or stress-induced exacerbation of functional 
disorders) must not be discarded because 
they do not fit in preconceived theories. 
There is insufficient evidence to guess about 
the cause of the sounds, let alone assess 
their potential health relevance. We need to 
halt the speculation and instead encourage 
more science and careful medicine. 


Mitchell Joseph Valdés-Sosa! and 
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Standardizing return 
of participant results 


As members of the National Academies 
of Sciences, Engineering, and Medicine 
committee that wrote the report on the 
return of individual research results (1), 
we reject the allegations in the Policy 
Forum “Return of results and data to 
study participants” (S. M. Wolf and B. J. 
Evans, 12 October, p. 159) that the report 
is paternalistic, misunderstands the law, 
burdens Institutional Review Boards 
(IRBs), and creates barriers to the return 
of results. 

In the National Academies report, we 
advocate regulatory changes to expand the 
opportunities to give research partici- 
pants access to their individual results. 
The Centers for Medicare and Medicaid 
Services (CMS) interprets the law govern- 
ing laboratory standards as prohibiting 
any communication about research 
results to participants unless the labora- 
tory is certified according to the Clinical 
Laboratory Improvement Amendments 
of 1988 (CLIA). Wolf and Evans contend 
that CMS does not have the statutory 
authority for this restriction. However, 
there is no consensus regarding this posi- 
tion (2, 3) and CMS’s interpretation has 
not been overruled by the courts. Given 
substantial penalties for noncompliance, 
many research institutions abide by CMS’s 
interpretation. Our report recommends an 
explicit change to the regulations to bring 
clarity to the field that will not be achieved 
by assuming that CMS’s position can be 
ignored, as Wolf and Evans suggest. 

We recommend that the Office of 
Civil Rights (OCR) clarify what research 
results participants have a right to 
under the Health Insurance Portability 
and Accountability Act (HIPAA) rule by 
clearly defining the Designated Record 
Set (DRS) to include all research results 
generated in laboratories that meet an 
accepted quality standard. Although the 
DRS includes information maintained by 
a covered entity that could be used for 
individual decision-making (4), there is 
no consensus about what research data 
should be included. In the absence of 
guidance from OCR, some institutions are 
excluding some research data from the 
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DRS (5). Our recommendation supports 
broadening participant access to high- 
quality research results while adhering to 
the principle that results lacking dem- 
onstrated quality should not be used by 
participants or their health care providers 
for individual decision-making. 

We disagree with the notion that our 
recommendations are paternalistic. During 
our study, we consulted a diverse group of 
community members, study participants, 
and advocacy groups to fully understand 
how individuals might use results, as well 
as barriers to using and understanding 
results. The committee sought a balance 
that promotes broad access to results 
while addressing public expectations that 
results are accurate. Disclosing poor- 
quality results reflects bad science and 
does not respect participant autonomy 
or welfare. We maintain that quality 
standards for research laboratories will 
better ensure accurate results that meet 
the expectations of participants and will 
enhance the overall validity and reproduc- 
ibility of the research enterprise. 

We believe that IRBs are up to the 
challenge of addressing the new respon- 
sibilities recommended by our report, 
although we acknowledge that these 
demands cannot be addressed overnight. 
The report recommends that investigators 
work with stakeholders to develop plans 
on whether and how to disclose results 
as protocols are developed. The informed 
consent process is key to fostering par- 
ticipant understanding of their options 
for return of results and to documenting 
expression of their preferences. IRBs must 
be involved in evaluating the return of 
results plan and consent process and, over 
time, will need to develop expertise and 
policies for this purpose. 

We are confident that our recom- 
mendations break down many of the 
existing barriers to the return of indi- 
vidual research results and, if followed, 
will enhance the collaboration among all 
stakeholders. Return of individual results 
is not a common practice (6) despite 
existing guidelines, and research partici- 
pants rarely request results under their 
HIPAA access rights. Our report promotes 
the routine consideration of return of 
results by funders, researchers, and par- 
ticipants; the development of standards 
and policies to foster return, greater 
transparency, and engagement with 
participants; and an informed consent 
process that informs participants of their 
opportunities and rights. 
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NEXTGEN VOICES: SUBMIT NOW 


Transition challenges 


Add your voice to Science! Our new 
NextGen VOICES survey is now open: 


Have you ever encountered a par- 
ticularly stark difference between an 
old and new position in your educa- 
tion or career? For example, was it a 
challenge to transition from under- 
graduate to postgraduate studies, 
from postgraduate studies to a first 
job, or from one field or career path to 
another? If so, what was the differ- 
ence between the positions, and what 
advice would you give to someone 
making a similar transition? 


To submit, go to 
www.sciencemag.org/nextgen-voices 


Deadline for submissions is 23 
November. A selection of the best 
responses will be published in the 

A January 2019 issue of Science. 
Submissions should be 150 words 
or less. Anonymous submissions will 
not be considered. 
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Comment on “Predicting reaction 
performance in C-N cross-coupling 
using machine learning” 


Kangway V. Chuang and Michael J. Keiser 
Ahneman et al. (Reports, 13 April 2018) 
applied machine learning models to predict 
C-N cross-coupling reaction yields. The 
models use atomic, electronic, and vibra- 
tional descriptors as input features. However, 
the experimental design is insufficient to dis- 
tinguish models trained on chemical features 
from those trained solely on random-valued 
features in retrospective and prospective test 
scenarios, thus failing classical controls in 
machine learning. 

Full text: dx.doi.org/10.1126/science.aat8603 


Response to Comment on “Predicting reac- 
tion performance in C-N cross-coupling 
using machine learning” 


Jesis G. Estrada, Derek T. Ahneman, 
Robert P. Sheridan, Spencer D. Dreher, 
Abigail G. Doyle 

We demonstrate that the chemical-feature 
model described in our original paper is 
distinguishable from the nongeneralizable 
models introduced by Chuang and Keiser. 
Furthermore, the chemical-feature model 
significantly outperforms these models 

in out-of-sample predictions, justifying the 
use of chemical featurization from which 
machine learning models can extract 
meaningful patterns in the dataset, as 
originally described. 

Full text: dx.doi.org/10.1126/science.aat8763 
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Comment on “Predicting reaction 
performance in C-N cross-coupling 
using machine learning” 


Kangway V. Chuang and Michael J. Keiser* 


Ahneman et al. (Reports, 13 April 2018) applied machine learning models to predict 
C-N cross-coupling reaction yields. The models use atomic, electronic, and vibrational 
descriptors as input features. However, the experimental design is insufficient to 
distinguish models trained on chemical features from those trained solely on random- 
valued features in retrospective and prospective test scenarios, thus failing classical 


controls in machine learning. 


dataset and evaluated the same machine learn- 
ing methods. All computational analyses were 
performed in the Python package Scikit-learn (6). 
In the first experiment, we replaced the extracted 
chemical features of each molecule with random 
numbers, which effectively creates a unique, ran- 
dom “barcode” mimicking the reaction finger- 
prints used in the paper (Fig. 1A). For example, 
the 27 chemical descriptors (‘H and “C nuclear 
magnetic resonance shifts, dipole moment, etc.) 
that had been used to represent 4-bromotoluene 
were replaced with 27 random numbers drawn 
from a standard normal distribution. Applying 
these random barcodes to the exact train-test 
split of the dataset used by Ahneman et al. 
resulted in “straw” models (7) that achieved 
predictive performance nearly identical to those 
trained on actual chemical features (Fig. 1B). In a 
related second experiment, we encoded each re- 
action component as a “one-hot” vector (ie, a 
“dummy” encoding) that denotes only the pres- 
ence or absence of each component (e.g., additive-1, 
additive-2, etc.; see Fig. 1A). (8) One-hot encoding 


recent report by Ahneman et ail. (1) de- 
scribes a machine learning approach for 
modeling chemical reactions with data col- 
lected through ultrahigh-throughput exper- 
imentation. The Buchwald-Hartwig coupling 
(2) is used as a model reaction, with a Glorius 


not be relied upon to generalize to new examples 
(i.e., reaction components). 

Following the logic of exclusions (4), we per- 
formed two experiments intended to disprove 
the alternative hypothesis, wherein we ablated 
all chemical information from the Ahneman e¢ al. 


likewise provided near-identical performance for 
each model (Fig. 1C). Critically, both of these 
approaches encode no notion of chemistry, and 
by definition cannot generalize to new chemical 
entities. We note that these results do not in- 
dicate that chemical features are unimportant, 


interference approach (3) to study reaction poi- 
soning by isoxazole additives. Reactions are rep- 
resented by atomic, electronic, and vibrational 
descriptors that are automatically calculated 
through a new computational pipeline. The 
authors find that random forest models out- 
perform linear models in predicting yields on 
a 70/30 train-test random split, and claim strong 
performance on an out-of-sample test set of un- 
seen isoxazoles. 

We applied the classical method of multiple 
hypotheses (4, 5) to investigate alternative ex- 
planations for the observed machine learning 
model performance. The experiments in this study 
explore the effect of four reaction parameters— 
aryl halide, catalyst, base, and additive—with all 
combinations exhaustively generated through 
4608 different reactions. This complete combi- 
natorial layout provides an underlying structure 
to the data irrespective of any chemical knowl- 
edge. Correspondingly, we posited the alternative 
hypothesis that the machine learning algorithms 
exploit patterns within the underlying experimen- 
tal design, instead of learning solely from mean- 
ingful chemical features. A model that learns 
patterns particular to an experimental layout, 
rather than from meaningful input features, can- 


Fig. 1. Comparison of input representation control 
experiments. (A) Schematic of Anneman’s featurization 
versus random features and one-hot encoded categorical 
features. (B) Machine learning models trained using random 
feature barcodes provide near-identical performance on the 
exact 70/30 train-test split reported. Note: Bayes’ generalized 
linear model was not assessed; five-neuron versus 
100-neuron networks are shown instead. (C) Comparison 

of coefficient of determination (R*) and RMSE values between 
input representations. 


Chuang et al., Science 362, eaat8603 (2018) 16 November 2018 
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Fig. 2. Out-of-sample performance on individual plate predictions and analysis of feature 
importance bias. (A) Platewise predictions on additives using Ahneman et al’s chemical 
featurization. (B) Analogous platewise predictions on additives using one-hot encoding. (C) Box plot 
of average feature importances extracted from 100 trials of shuffled data, showing median values 
with first and third quartiles. Plot whiskers represent minimum and maximum importance values 
across random trials. (D) Top 10 feature importances from a single representative trial. 


but instead suggest that the retrospective study 
performed in the paper is incapable of distin- 
guishing between meaningful featurization and 
random featurization. 

Prospective out-of-sample test sets provide a 
more rigorous measure of model generalization. 
Ahneman et al. reported that an out-of-sample 
set of eight isoxazole additives, representing 
one of the three 1536-well plates used in their 
study, shows good generalization [root mean 
square error (RMSE) = 11.3%]. To account for 
variance in sampling and to establish a compre- 
hensive picture, we completed analyses that, in 
turn, independently hold out each of the remain- 
ing two experimental plates (Fig. 2A). We found 
that performance dropped starkly (RMSE = 22.0 
and 17.3%, respectively), indicating that the models’ 
generalizability was more limited than expected. 
Similarly, we repeated the analysis using one-hot 
encoded representations in place of chemical 
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features, as in Fig. 1A. Machine learning algo- 
rithms trained on one-hot representations learn 
only from the presence or absence of additives in 
the training set, and by definition cannot gen- 
eralize to unseen additives. We thus anticipated 
highly diminished straw model performance in 
this sanity check. Surprisingly, platewise perform- 
ance (Fig. 2B) using one-hot encoding tracked 
closely with that obtained using chemical features 
(Fig. 2A, as did random barcode results, not 
shown). These results failed to strongly dis- 
tinguish meaningful from random featuriza- 
tion, despite the prospective context. 

Looking beyond prediction performance, 
Ahneman et al. thoughtfully analyzed the rela- 
tive importance of the chemical features used 
by their top-performing random forest model. 
They found that isoxazole additive-based de- 
scriptors most significantly affect yield pre- 
diction mean square error under permutation 
analysis (9). By contrast, the random-feature 
and one-hot encoding straw experiments we 
performed above suggest that isoxazole ad- 
ditives play only a limited role in predicting 
reaction outcome, and we looked to understand 
this discrepancy. Traditional random forest im- 
plementations can exhibit feature-importance 
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bias when inputs vary in scale or when catego- 
ries vary in number of classes (10, 11). We sus- 
pected that additive feature importance may be 
enriched as the result of a similar effect. Con- 
sequently, we shuffled all training data to decor- 
relate the predictive variables (features) from 
the output (yields) and trained a random forest 
regressor on the shuffled data. In 100 trials of 
this randomized-data test, additive features were 
nonetheless consistently identified as most im- 
portant, and consistently occupied 9 of the top 10 
by rank (Fig. 2, C and D). These results indicate 
that apparently high additive feature importances 
cannot be distinguished from hidden structure 
within the dataset itself. 

We believe that these results, taken together, 
illustrate the need to incorporate random-control 
procedures (7) when applying machine learning 
to new scientific domains. We find that the ex- 
perimental design is insufficient to establish that 
models built on the proposed chemical fea- 
turization can generalize to new chemical en- 
tities, or meaningfully outperform straw models 
trained on randomly assigned features. However, 
we do not conclude that chemical features are 
unimportant, nor that the ones used here are 
necessarily incorrect. Nor do we believe that 
careful design of chemical features is futile. Rather, 
further studies that more expansively explore 
each reaction dimension (additional bases, lig- 
ands, substrates, and additives) may be a means 
to demonstrate that these models can be usefully 
adopted for reaction prediction. Flexible and 
powerful machine learning models have become 
widespread and readily available. As these tools 
permeate the physical and life sciences, so too must 
accompanying methods to distinguish models 
that learn peculiarities of an experiment’s layout 
from those that extract meaningful and actionable 
patterns beyond it. With randomized controls to 
guide experimental design, Ahneman et al.’s 
novel machine learning approach to reaction 
prediction may best prove its merit. 
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Response to Comment on “Predicting 
reaction performance in C-N 
cross-coupling using machine learning” 


Jesis G. Estrada’, Derek T. Ahneman’, Robert P. Sheridan”, 


Spencer D. Dreher**, Abigail G. Doyle’* 


We demonstrate that the chemical-feature model described in our original paper is 
distinguishable from the nongeneralizable models introduced by Chuang and Keiser. 
Furthermore, the chemical-feature model significantly outperforms these models in out-of- 
sample predictions, justifying the use of chemical featurization from which machine learning 
models can extract meaningful patterns in the dataset, as originally described. 


n Ahneman et al. (1), we showed that a 

random forest (RF) algorithm built using 

computationally derived chemical descrip- 

tors for the components of a Pd-catalyzed 

C-N cross-coupling reaction (aryl halide, lig- 
and, base, and potentially inhibitory isoxazole 
additive) could identify predictive and meaningful 
relationships in a multidimensional chemical 
dataset comprising 4608 reactions. Chuang and 
Keiser (2) built alternative models using random 
barcode features (“straw” models), wherein 
the chemical descriptors are replaced with ran- 
dom numbers selected from a standard normal 
distribution. One-hot encoded features, wherein 
each reagent acts as a categorical descriptor and 
is marked as absent or present, were also eval- 
uated. Models built with either set of label fea- 
tures are not generalizable, meaning that they 
cannot make distinct predictions for new chem- 
ical entities not found in the training set. Using 
these alternative models, Chuang and Keiser con- 
clude that the dataset described in our paper is 
insufficient to establish that models built with 
chemical features can generalize to new chem- 
ical entities or outperform models built with 
reagent-label features. However, the authors 
disregard the chemistry underlying the dataset, 
and in so doing, they base their conclusions on 
test sets that are poor indicators of model sim- 
ilarity (Plate 1 or 3) and performance (Plate 2). 
Here, we show that our original out-of-sample 
test set (Plate 3), although representative of the 
generalizability of the chemical descriptor model, 
was suboptimal in distinguishing nongeneraliz- 
able models because it was composed of primarily 
average-yielding additives. However, using rigor- 
ous tests of generalizability, we demonstrate that 
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the chemical descriptor model presented in our 
original study is statistically distinct from and 
significantly outperforms models built on reagent- 
label features. 

In our original paper, we demonstrated that a 
RF model delivered high predictive performance 
among a panel of machine learning (ML) algo- 
rithms in a 70/30 train-test split of the dataset. 
Chuang and Keiser show that models built with 
barcodes and one-hot encoding achieved near- 
identical predictive performances. Because a 70/ 
30 random split of the entire data results in a test 
set composed of reactions with components that 
the model has seen in the training set at least 
once, a ML algorithm is capable of learning the 
reactivity of each reaction component. Thus, 
ML algorithms can perform well using a variety 
of representations for the reaction components, 
whether the representations are continuous 
chemical descriptors or reagent labels. For this 
reason, retrospective tests like those in our man- 
uscript and Chuang and Keiser’s comment can 
only be used to conclude that the RF algorithm 
outperforms other ML algorithms. 

That a ML algorithm can be built with random 
barcodes or reagent labels does not mean that 
these or the chemical descriptors are meaning- 
less, nor does it invalidate any structure-activity 
relationship present in a chemical descriptor 
model. Performing a Y-randomization test—a 
well-accepted control for interrogating the null 
hypothesis that there is no structure-activity 
relationship in the data—on the chemical de- 
scriptor model results in an average cross- 
validated R? value of -0.01, demonstrating that 
the model encodes meaningful information (3, 4). 
Nonetheless, a model built on reagent labels as 
descriptors cannot be extrapolated to chemical 
entities not in the original set. For that, one 
needs some flavor of chemical descriptors. Thus, 
out-of-sample prediction is the appropriate test 
of generalizability and is the overall justification 
for using chemical features. 
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In our manuscript, we investigated the gen- 
eralizability of the chemical descriptor model 
by using isoxazole additives on Plates 1 and 2 for 
training and additives on Plate 3 for out-of- 
sample predictions. Chuang and Keiser also in- 
vestigated two alternative splits along plate lines. 
In so doing, they found that prediction of Plate 2 
additives is poor [R® = 0.19, root mean square 
error (RMSE) = 21.7%] and conclude that the 
generalizability of the chemical descriptor model 
is more limited than we reported. However, to 
use the large variation in model performance 
across the different plate test sets to assess model 
generalizability, one must assume that the train- 
ing sets of all three models cover a similar spread 
in chemical space. Figure 1A illustrates the effect 
of additives on yield across plate lines. Among 
the 23 additives examined, four additives (10, 
11, 13, and 14) serve as severe reaction poisons, 
resulting in substantially lower average yields 
than the rest. All four of these additives are lo- 
cated in Plate 2. Thus, a test set comprising Plate 
2 additives involves a training set without any of 
the reaction poisons. Such a training set would 
be expected to result in a poorly predictive model 
whose performance would also be a poor indicator 
of generalizability (5). Investigating the Plate 2 
predictions further, we replaced one of the re- 
action poisons (13) in Plate 2 with an average- 
yielding isoxazole (2) to afford a Plate 2’ test set, 
thus guaranteeing that the training set includes 
at least one example of a reaction poison. The 
model performance increased from R? = 0.19 to 
R? = 0.64 (Fig. 1B) (6). 

For a more systematic evaluation of model 
generalizability, we turned to activity ranking 
for out-of-sample test set design (Fig. 1C), which 
is considered a better indicator of generalization 
than random splitting, as used in our original 
study (7). Using this method of training/test set 
design, we split the data into four additive out-of- 
sample test sets, resulting in models with an R” 
range of 0.69 + 0.12. Whereas the observed mean 
performance is slightly lower than we reported 
based solely on Plate 3 (R? = 0.81), Plate 3 pre- 
dictions are well within the observed range, as 
are Plate 1 and Plate 2' predictions (R” = 0.66 and 
0.64, respectively). By comparison, Plate 2 pre- 
dictions (R? = 0.19) are significantly out of the 
observed range of performance. These results con- 
firm that the chemical descriptor model has good 
generalizability along the additive dimension. 

We next turned to the question of whether the 
chemical descriptor model was distinguishable 
from the nongeneralizable models. Chuang and 
Keiser show that the three models have similar 
aggregate test set performances for Plate 3 pre- 
dictions. However, evaluating the predictions of 
the models for individual additives on Plate 3 
reveals that the models make distinct predic- 
tions. For example, a plot of predicted yields 
for the chemical descriptor versus one-hot en- 
coded models, as shown in Fig. 2A, illustrates 
that the chemical descriptor model makes dif- 
ferent predictions for different out-of-sample ad- 
ditives, something that the one-hot and random 
barcode models cannot do. Furthermore, for the 


lof3 


810Z ‘6| 4equwenoN UO /Hio HewWweouUslos's0uUsI0S//:di1y Wo pepeojuMOG 


RESEARCH | TECHNICAL RESPONSE 


poorer-performing additives in the test set (16 
and 18), the one-hot encoded model overpredicts 
yields (e.g., R? = 0.31 for 16), consistent with the 
model’s inability to predict these additives as 
mild reaction poisons (Fig. 2B). By contrast, the 
chemical descriptor model predicts that these 
additives will lead to diminished yields relative 
to the average (e.g. R? = 0.90 for 16), thereby 
capturing chemically meaningful information in 
the additive dimension. 

Given these findings, why are the aggregate 
test set performances between the various mod- 
els for Plate 3 predictions similar? Using prin- 
cipal components analysis, we found that six of 
the eight additives in the test set have highly 
correlated outputs and are average-yielding. As 
such, the nongeneralizable models perform com- 
petitively with the chemical descriptor model 
because these models make a single prediction 
that is an average of all of the additives in the 
training set. A chemical descriptor model is ex- 
pected to statistically outperform a one-hot model 
in test sets with a greater number of extreme 
outcomes. Indeed, the Plate 2’ test set described 
in this response exhibited a large difference be- 
tween the two models, with R? values of 0.64 
versus 0.19 (Fig. 1B). Thus, we proceeded to use 
Plate 2 as a template to evaluate Chuang and 
Keiser’s null hypothesis that the two models 
are indistinguishable (8). 

A total of 14 test sets were created by replacing 
one (four test sets), two (six test sets), or three 
reaction poisons (four test sets). Not surprisingly, 


Fig. 1. Identifying reaction A 100 
poisons and their effect on 
prediction. (A) Average 80 
yields (dots) and standard 
deviations (error bars) of the 
180 reactions involving all 
combinations of aryl halides 
(15), catalysts (4), and bases a 
(3) of each of the 23 addi- 

tives except additive 7 (con- 20: 
trol). Box highlights four 
additives (10, 11, 13, and 14) 


Plate 1 


60 


Yields (%) 


with substantially lower 246 


reaction yields than the 
rest, indicating their Cc 
characteristic as reaction 


poisons. (B) The Plate 2 Additives 
out-of-sample test set was Test Set 1 
altered by replacing additive Test Set 2 
13, a reaction poison, with 

average-yielding isoxazole 2, Test Set 3 
thereby ensuring that the Test Set 4 


training set contained one 
additive that served as a 
reaction poison. RF models 
were built using chemical 
descriptors and one-hot 


we found the greatest difference in performance 
between the chemical descriptor and one-hot 
model for the test sets incorporating three re- 
action poisons (Fig. 2C). Across the 14 test sets, 
we observed R? values of 0.63 + 0.12 for the 
chemical descriptor model and 0.36 + 0.20 for 
the one-hot model; these values indicate that 
the one-hot model affords an overall worse and 
more variable predictive performance and that 
the models are distinguishable at a statistically 
significant level (P < 0.01). To evaluate the pre- 
dictive value of the additive features for these 
14 test sets, we also compared the chemical- 
feature model to a RF model built using no chem- 
ical features for the additives (R? = 0.54 + 0.15) 
and a model built using one-hot features for the 
additives but chemical features for the aryl halides, 
bases, and ligands (R? = 0.36 + 0.19) (Fig. 2D). 
These experiments clearly show the benefit of 
using chemical descriptors for out-of-sample 
prediction and demonstrate that the chemical 
descriptors used in our original study are not 
solely acting as reagent identifiers (9). 

Having confirmed that the chemical descrip- 
tor model is generalizable and that the features 
used have chemical meaning, variable-importance 
analysis provides a useful tool to obtain chemical 
insights and guide mechanistic inquiry, as high- 
lighted in our original study. Nonetheless, Chuang 
and Keiser show that RF algorithms can exhibit 
descriptor bias, which can skew the analysis of 
important features. To evaluate the impact that 
this might have had on our analysis, we in- 


Plate 2 Plate 3 


1" R2=0.64 
RMSE = 15.2 _ 


75- 


B Plate 2’ (replace 13 with 2) 
Chemical Descriptors 


vestigated an alternative to the randomforest 
function: the cforest function, which has been 
shown to avoid descriptor selection bias (J0). 
Use of the cforest algorithm resulted in cross- 
validation test set statistics (R” = 0.84) similar to 
those of the randomforest function. As in our 
original study, aryl halide and additive *C3 nu- 
clear magnetic resonance (NMR) shifts appeared 
in the top five chemical descriptors, reinforcing 
the inference that led to the experiments de- 
signed to test whether competitive oxidative ad- 
dition of the isoxazole could be a source of 
deleterious side reactivity. Evaluation of a deci- 
sion tree (DT) model further supplemented our 
analysis of important descriptors. The aryl halide 
and additive *C3 NMR shifts appear in the first 
two discriminating nodes, consistent with the 
variable importances from the RF model dis- 
cussed above. Analysis of how chemical descrip- 
tors bin reaction components in the DT model 
(i.e., along aryl halide electronic properties) sug- 
gests that the chemical descriptors supply the 
RF model with the ability to recognize chem- 
ical phenomena along the aryl halide and ad- 
ditive dimensions (17). 

In summary, Chuang and Keiser’s reagent- 
label models are valuable representations of a 
closed dataset and useful comparator models in 
tests of generalizability. Incorporation of these 
models into our workflow has revealed that the 
out-of-sample validation test in our study was 
not an optimal test for generalization; however, 
it delivered a performance representative of the 
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encoded labels for comparison. (C) To guarantee that the training set and test set would cover chemical space similar to that covered by the entire 
dataset, we designed test sets according to activity ranking. Isoxazole additives were ranked according to increasing average yields. The lowest- 

and highest-yielding additives were kept in all training sets to maximize chemical space. The middle 20 isoxazole additives were used to form four out-of- 
sample test sets by taking every fourth additive as shown; the remaining additives were used for training. Coefficient of determination (R®) and 


RMSE were used to analyze model performance. 
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Fig. 2. Distinguishing 100: ; iis & 
chemical featuriza- oe 
E « Additive 16 
tion from one-hot 2 89 
encoding. (A) Com- = 
parison of chemical 3 
descriptor model yield a 80 
predictions versus 2 
one-hot model yield 8 40 
predictions for additives F 
16 (triangles, light blue) 5 a 


and 23 (circles, dark 
blue). A nongeneraliz- 
able model cannot 0 
make distinct predic- 
tions for out-of-sample 
additives. Shown are 
two distinct predictions 
made by the chemical 
descriptor model. 

(B) Calibration plots of 
observed versus pre- 
dicted yields for addi- 
tive 16, a mild reaction 
poison. The chemical 
descriptor model (left) 0.0 
captures the effect of 

additive 16 causing 

lower reaction yields, 

whereas the one-hot 

model (right) overpre- 

dicts. (C) Analysis of 


Cc 1.0 


various prospective predictions according to R* values. Plates 1 and 3, which 
contain no significant reaction poisons in the test set, show minimal 
differences between the chemical descriptor and one-hot models. Plate 2 
contains all four reaction poisons, resulting in a poorly designed training 

set. Fourteen test sets were designed to incorporate various numbers of reaction 
poisons and were used to assess the robustness of the chemical descriptor RF 
model relative to a one-hot encoded RF model. Shown are R* averages (dots, 


model’s generalizability. On the other hand, our 
evaluation of Chuang and Keiser’s conclusions 
highlights that an understanding of the chemical 
reactivity underlying a dataset is necessary in 
order to use the dataset and reagent-label models 
to assess the scope and limitations of chemical 
featurization for reaction prediction. Ultimately, 
our original conclusion that the RF model is based 
on meaningful and generalizable chemical fea- 
tures has been strengthened by this additional 
analysis. 

Machine learning offers numerous opportuni- 
ties to augment how synthetic chemists generate 
and use data for discovery, optimization, and 
adoption of synthetic methods (12). Its advance- 
ment and proliferation will require continued 
progress in the collection, analysis, and reporting 
of data, in the description of chemical space, and 
in predictive modeling. Constructive discussions 
among chemists, computer and data scientists, 
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and chemical engineers will be important in 
making this happen. 
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triangles) with standard deviations. (D) Comparison of the 14 additive out-of- 
sample test performances of RF models in which the additives are described 
by chemical descriptors (Chemical), no features (None), one-hot features 
(Onehot), and straw features (Straw). Relative to absence of additive 
features, the use of chemical descriptors boosts model performance, 
whereas the use of reagent label features diminishes model performance. 
Shown is a box-and-whisker plot. 


ll. For the smaller base and catalyst dimensions, it is difficult to 
distinguish whether the RF model uses chemical features to 
describe meaningful chemical patterns (i.e., Base N1 
electrostatic charge) or simply to label reagents. 

12. For another study using a RF algorithm to predict reaction 
performance wherein we constructed one-hot models as 
comparator models and found that they delivered significantly 
inferior predictive ability for out-of-sample test sets relative to 
a chemical descriptor model; see (13). 

13. M. K. Nielsen, D. T. Anneman, O. Riera, A. G. Doyle, J. Am. 
Chem. Soc. 140, 5004-5008 (2018). 
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n every stage race, competitive cyclists perform an experiment of sorts to 
discover who can most efficiently turn dietary energy sources into maxi- 
mal power output on the bike. More broadly, we’re all interested in diet 
because abundant evidence shows that diet has major effects on human 
health and resistance to rampant diseases associated with aging, such 
as obesity, cardiovascular disease, and diabetes. Advice on what consti- 
tutes a healthy diet is more prevalent and more inconsistent than ever. 
For this special issue, we checked in with the experts. On the question 
of how much fat we should eat, recommendations have swung from one 
extreme to the other. We consulted with a group of scientists representing 
different sides in the debate over the proportion of fat in a healthy diet and, 
importantly, which particular fats are most healthful. We share our 
meals with trillions of bacteria in the digestive system, so a prom- 
ising and emerging area of investigation explores how diet 
influences our give-and-take interaction with gut symbi- 
onts. It’s not just what you eat but when you eat it, and 
periods of fasting have some remarkable benefits. 
A pervasive theme is that much of the disagree- 
ment and confusion reflects a lack of solid sci- 
entific studies on humans. Clearly, many more 
well-designed studies are needed to deter- 
mine the best diet for people, and how that 
varies with activity, at different life stages 
and for different individuals. And indi- 
vidual needs can be extreme—a cyclist 
at the top of the sport recorded mas- 
sive carbohydrate loading before 
an intense stage, eating the equiva- 
lent of 85 slices of bread! 


Maximum performance or maximized health 
benefits require a nutritious mix of foods in the diet. 
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REVIEW 


Dietary fat: From foe to friend? 


David S. Ludwig’?*, Walter C. Willett”**, Jeff S. Volek*, Marian L. Neuhouser® 


For decades, dietary advice was based on the premise that high intakes of fat cause 
obesity, diabetes, heart disease, and possibly cancer. Recently, evidence for the adverse 
metabolic effects of processed carbohydrate has led to a resurgence in interest in 
lower-carbohydrate and ketogenic diets with high fat content. However, some argue 

that the relative quantity of dietary fat and carbohydrate has little relevance to health and 
that focus should instead be placed on which particular fat or carbohydrate sources are 
consumed. This review, by nutrition scientists with widely varying perspectives, 
summarizes existing evidence to identify areas of broad consensus amid ongoing 
controversy regarding macronutrients and chronic disease. 


report by the U.S. Senate Select Commit- 
tee on Nutrition and Human Needs in 1977 
called on Americans to reduce consump- 
tion of total and saturated fat, increase car- 
bohydrate intake, and lower calorie intake, 
among other dietary goals (7). This report, by 
elected members of Congress with little scientific 
training, was written against a backdrop of grow- 
ing public concern about diet-related chronic dis- 
ease, precipitated in part by attention surrounding 
President Eisenhower’s heart attack in 1955. 

Even then, the recommendations were hotly 
debated. The American Medical Association stated 
that “The evidence for assuming benefits to be 
derived from the adoption of such universal die- 
tary goals as set forth in the report is not con- 
clusive ... [with] potential for harmful effects.” 
Indeed, the lack of scientific consensus was re- 
flected in the voluminous, 869-page “Supplemental 
Views” published contemporaneously by the com- 
mittee. Nonetheless, reduction in fat consumption 
soon became a central principle of dietary guide- 
lines from the U.S. government and virtually all 
nutrition- and health-related professional organi- 
zations. [Note that modern approaches to the 
study of diet-related chronic diseases were at 
that time in their infancy; previously, nutritional 
science was focused on individual nutrients for 
the prevention of deficiency diseases (2).] 

The Surgeon General’s Report on Nutrition 
and Health in 1988 identified reduction of fat 
consumption as the “primary dietary priority,” 
with sugar consumption only a secondary con- 
cern for children at risk for dental caries (3). The 
1992 Food Guide Pyramid of the U.S. Department 
of Agriculture advised eating 6 to 11 daily servings 
of starchy foods such as bread, cereal, rice, and 
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pasta while limiting all fats and oils. To facil- 
itate this goal, the U.S. Healthy People 2000 
report of the Department of Health and Human 
Services called on the food industry to market 
thousands of new “processed food products that 
are reduced in fat and saturated fat” (4). This 
intensive focus on reducing dietary fat was driv- 
en by a prevailing belief that carbohydrates—all 
carbohydrates, including highly processed grains 
and sugar—were innocuous and possibly protec- 
tive against weight gain, cancer, and cardiovas- 
cular disease through multiple mechanisms (5). 
As a result, the proportion of fat in the U.S. 
diet decreased from about 42% in the 1970s to 
about 34% of total calories today (somewhat 
greater than the stated goal of <30%) and the 
proportion of dietary carbohydrates increased 
substantially (6). During this time, rates of obesity 
and diabetes increased greatly, contributing to 
the first nationwide decrease in life expectancy 
since the flu pandemic 100 years ago (7). These 
trends could be causally connected or unrelated. 
If causal, how could some traditional socie- 
ties, such as that of Okinawa, enjoy relative free- 
dom from chronic disease and long lifespan 
when they consume a low-fat diet (8)? In Mexico, 
Brazil, and China, rates of obesity and diet-related 
chronic diseases have also increased without sim- 
ilar government dietary guidance to individuals 
and food manufacturers. Moreover, many other 
aspects of the American diet changed in the past 
40 years, including increased portion sizes, greater 
consumption of foods away from home, and more 
extreme food processing. At the same time, labor- 
saving technology and the digital age have led to 
declines in occupational and recreational physical 
activity, and budget shortfalls in schools have led 
to curtailments in physical education classes, recess 
time, and after-school recreation opportunities. 
Despite a lack of clear evidence specifically 
relating fat consumption (as a proportion of 
total energy intake) to the epidemics of diet- 
related disease—and a lack of high-quality, 
long-term trials focused on macronutrients in 
general—the pendulum has recently swung in the 
opposite direction, with rising consumer popu- 
larity of low-carbohydrate, high-fat diets. Among 
the current top-10 best-selling weight loss books 
on Amazon.com, four promote a ketogenic diet 
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with energy intake derived mainly from fat. In 
support of higher fat intake, several meta-analyses 
found slightly greater weight loss on high-fat rather 
than low-fat diets (9, 10), and preliminary data 
suggest the potential for excellent control of dia- 
betes through carbohydrate restriction (11, 12). 
But versions of low-carbohydrate, high-fat diets 
have been around at least as early as the 1800s, 
with no clear evidence of superiority for long-term 
obesity treatment at present. And regardless of 
body weight, high intakes of fat—especially from 
red meat and dairy products—might increase risk 
for heart disease or cancer. 

Perhaps both high-carbohydrate, low-fat and 
low-carbohydrate, high-fat diets have benefit for 
different populations or for different clinical out- 
comes, and the critical issue is to identify the op- 
timal macronutrient ratio for an individual. Or 
perhaps the focus on macronutrient quantity has 
been a distraction, and qualitative aspects (the 
particular sources of fat or carbohydrate) and 
overall eating patterns are more important. 

To explore these issues, we have joined to- 
gether as scientists with a diversity of expertise, 
perspectives, and prior research focus. Our aim is 
not to assemble a premature consensus among 
the like-minded, but rather to identify areas of 
general agreement and delineate a research agenda 
to address long-standing controversies. 


The case for a low-fat, 
high-carbohydrate diet 
Physiologic mechanisms 


Among many societies worldwide, carbohydrate 
is the primary source of energy, providing 50% or 
more of daily energy, with lesser amounts from 
both fat and more expensive and scarce protein. 
Population-level or ecological studies comparing 
global chronic disease rates show that less devel- 
oped countries have lower rates of cardiovascular 
disease, obesity, and cancer than more Wester- 
nized countries. When individuals move from 
countries with low chronic disease rates to Wester- 
nized countries, their incidence of chronic diseases 
approaches that of their new country within one 
to two generations. This rapid shift in chronic 
disease rates spurred thinking that environmental 
exposures, such as adoption of a higher-fat Western 
diet, may be causally related to disease risk pat- 
terns. [A low-fat diet typically contains <30% en- 
ergy as fat, and a very-low-fat diet <20%, versus 
32 to 36% in the United States (6).] 

Humans ingest complex food mixtures that 
include macronutrients (fat, carbohydrate, and 
protein) and alcohol as energy sources. Macro- 
nutrients have highly regulated yet integrated 
metabolic interactions. One consideration for 
judging optimal macronutrient intake is the 
relative efficiency of substrate oxidation and inter- 
conversion. Humans preferentially oxidize carbo- 
hydrate over fat, a process that helps to maintain 
blood glucose within homeostatically controlled 
ranges. Further, carbohydrate consumption acutely 
increases carbohydrate oxidation, with only a quan- 
titatively small increase in de novo lipogenesis 
under typical conditions (13-16). Humans have 
limited storage capacity for carbohydrate but also 
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have extensive adipose stores, thus favoring fat 
deposition with excess fat intake (17, 18). Fat is 
also highly palatable and may have a weak effect 
on satiation, potentially leading to passive over- 
consumption (78). This excess intake, if not coupled 
with increased energy expenditure, results in weight 
gain. This effect may be enhanced because, by 
weight, fat provides more than twice as much 
energy (9 kcal/g) as carbohydrate or protein 
(4 kcal/g). Conversely, diets rich in whole grains, 
which are low in fat and have a relatively low 
glycemic load, promote satiety and reduce over- 
consumption, possibly by increasing concentra- 
tions of glucagon-like peptide-1 after eating (19). 
Of 29 diets with varying macronutrient com- 
position tested in mice, only high-fat diets, but not 
high-carbohydrate diets, led to overconsumption 
and weight gain (20). Of particular interest, the 
high-fat diets increased expression of three sero- 
tonin receptors and both dopamine and opioid 
signaling pathways, components of the reward 
system in the hypothalamus. 

Fat and specific fatty acids also have adverse 
metabolic effects independent of calorie content. 
High-fat diets up-regulate inflammatory media- 
tors including tumor necrosis factor-a. (TNF-a), 
interleukins (IL-1f, IL-6) (27), complement (22), 
and Toll-like receptors (23) in human and animal 
studies. In contrast, lower-fat diets reduce amounts 
of these and other inflammatory cyto- 
kines, as well as activity of the tran- 
scription factor NF-«B (24). Palmitic and 
stearic acids (abundant in animal foods) 
influence the structure and function of 
mitochondrial membranes, such that an 


with less secondary bile acid production and 
greater production of butyrate and other short- 
chain fatty acids that lower inflammation, decrease 
cellular proliferation, and enhance expression 
of genes with antineoplastic properties. Low-fat 
diets may also decrease serum estradiol and in- 
crease sex hormone-binding globulin (32, 33) 
and may reduce other breast cancer risk factors 
such as mammographic density (34), although 
the persistence of these effects remains unclear. 

Taken together, these multiple physiologic mech- 
anisms suggest that higher dietary fat may be 
harmful for health. However, it is critically im- 
portant to consider carbohydrate quality when 
fat intake is lowered. Refined grains provide negli- 
gible nutrition and their high glycemic load causes 
unhealthful spikes in postprandial glucose and 
insulin, promoting hunger, inflammation, insulin 
resistance, and dyslipidemia. However, with a 
lower-fat diet containing high-fiber, low-glycemic 
carbohydrates such as minimally processed grains, 
legumes, and nonstarchy fruits and vegetables, 
these measures improve. Whole plant foods are 
also rich sources of micronutrients, antioxidants, 
and phytochemicals with beneficial health effects. 


Obesity and diabetes 


Low-fat diets may favorably influence body weight 
and adiposity. In the Women’s Health Initiative 


ie 
Box 1. Current controversies. 


1. Do diets with various carbohydrate-to-fat proportions 
affect body composition (ratio of fat to lean tissue) 


Dietary Modification Trial (WHI-DM), the low- 
fat intervention (20% energy as fat, as part of 
a healthy eating pattern) was associated with 
significant, small reductions in body weight, 
total fat mass, and percent body fat as measured 
by dual x-ray absorptiometry (35). Another ran- 
domized controlled trial (RCT) in postmenopausal 
women tested a lower-fat, higher-carbohydrate 
diet (20% and 65% energy, respectively), a lower- 
carbohydrate, higher-fat diet (45% and 35% ener- 
gy, respectively), and a walnut-rich higher-fat, 
lower-carbohydrate diet (18%, 35%, and 45% 
energy, respectively) for weight loss. All three 
diets led to weight loss at 12 months, with slightly 
higher weight loss in the lower-fat diet group 
(33). A meta-analysis of dietary intervention trials 
showed that low-fat diets were effective for weight 
loss under ad libitum conditions (36); however, 
this was published prior to recent carbohydrate- 
restricted diet studies. 

Although obesity has a dominant role in the 
development of diabetes, clinical trial evidence 
suggests benefit for low-fat eating patterns in risk 
reduction and disease management. The Diabetes 
Prevention Program (DPP) was an RCT of 3234 
adults at risk for diabetes (37). DPP’s primary goal 
was to compare the effect of at least 7% reduc- 
tion in body weight achieved by following a low- 
calorie, low-fat diet and increasing physical 
activity, with that of the drug metformin 
or a placebo. Rates of diabetes incidence 
were reduced by 58% in the lifestyle inter- 
vention group and by 31% in those taking 
metformin, although the effects of dietary 
composition cannot be fully disentangled 


increase in these saturated fatty acids 
leads to impaired membrane function 
(25). High-fat diets may also promote 
unfavorable epigenetic profiles. For ex- 
ample, excess saturated fat changes 
DNA methylation patterns in adipose 
tissue (26) and skeletal muscle, and 
alters histone acetylation (27, 28). When 
acetyl-coenzyme A concentrations are 
high, such as under conditions of low 
glucose, histone acetylation increases 
according to in vitro human and animal 
studies (28). 

High-fat diets also stimulate hepatic 
bile acid synthesis, which, after conver- 
sion into secondary bile acids in the 
colon, may promote tumorigenesis (29-31). 
Among Africans consuming a diet high 
in minimally processed carbohydrates, 
gut microbial communities were domi- 
nated by butyrate-producing bacteria, 
whereas genetically similar African Amer- 
icans consuming a high-fat diet had a 
less healthful gut microbiome with high 
secondary bile acid production (3D). Fat- 
stimulated production of bile acids was 
also unfavorably associated with in- 
flammation and proliferation in colonic 
biopsy samples (29-37). Conversely, high- 
carbohydrate diets containing whole 
grains and other high-fiber foods provide 
the preferred fuel for colonic bacteria, 
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independently of energy intake? Do they affect energy 
expenditure independently of body weight? 


2. Do ketogenic diets provide metabolic benefits beyond 
those of moderate carbohydrate restriction? Can they 
help with prevention or treatment of cardiometabolic disease? 


3. What are the optimal amounts of specific fatty acids 
(saturated, monounsaturated, polyunsaturated) in the 
context of a very-low-carbohydrate diet? 


4. What is the relative importance for cardiovascular disease 
of the amounts of LDL cholesterol, HDL cholesterol, 

and triglycerides in the blood, or of lipoprotein particle size, 
for persons on diets with distinct fat-to-carbohydrate ratios? 
Are other biomarkers of equivalent or greater importance? 


5. What are the effects of dietary fat amount and quality 
across the lifespan on risk of neurodegenerative, pulmo- 
nary, and other diseases that have not been well studied? 


6. What are the long-term efficacies of diets with different 
carbohydrate-to-fat proportions in chronic disease pre- 
vention and treatment under optimal intervention conditions 
(designed to maximize dietary compliance)? 


7. What behavioral and environmental interventions can 
maximize long-term dietary compliance? 


8. What individual genetic and phenotypic factors predict 
long-term beneficial outcomes on diets with various fat-to- 
carbohydrate compositions? Can this knowledge inform per- 
sonalized nutrition, with translation to prevention and treatment? 


9. How does variation in the carbohydrate-to-fat ratio and in 
sources of dietary fat affect the affordability and 
environmental sustainability of diets? 
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from weight loss and other factors. Nu- 
merous other trials and observational 
studies support the use of high-fiber whole 
grains and fiber supplements for diabetes 
prevention and control. A recent meta- 
analysis found that fiber, typically con- 
sumed in greater amounts in low-fat, 
high-carbohydrate diets, improved mea- 
sures of glycemia and weight (38). 


Cardiovascular disease 


The effects of dietary macronutrient com- 
position on cardiovascular disease (CVD) 
risk have been a subject of debate for 
more than 40 years. Ecological studies 
and controlled feeding trials supported 
associations of higher-fat diets with CVD 
or its biomarkers of risk. However, defin- 
itive trials have not been conducted that 
explicitly test this “diet-heart hypothesis.” 
WHI-DM was not designed to test CVD 
endpoints; even so, participants in the 
low-fat group had significantly lower low- 
density lipoprotein (LDL) cholesterol and 
metabolic syndrome scores and no un- 
favorable changes to high-density lipo- 
protein (HDL) cholesterol or triglycerides 
relative to those of controls (39). Although 
the overall results of WHI-DM were neg- 
ative for CVD, follow-up showed that 
women without baseline hypertension 
had a 30% reduced CVD risk, whereas 
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Fig. 1. Pleiotropic effects of low-carbohydrate, high-fat diets. Ketogenic diets (aqua) may 
enhance these effects and act through additional mechanisms. Abbreviations: BOHB, B-hydroxybutyrate; 
HDAC, histone deacetylase; NAD*, nicotinamide adenine dinucleotide; mTOR, mechanistic target 


of rapamycin. 


those with baseline hypertension or prior CVD 
had no benefit or increased CVD risk; these 
findings suggest that a low-fat diet might have a 
greater effect on prevention than treatment (40). 

In a meta-analysis of RCTs, addition of at 
least 3 g of oat B-glucan per day reduced total 
and LDL cholesterol without unfavorable ef- 
fects on triglycerides or HDL cholesterol (41), 
highlighting the benefits of a low-fat, grain- 
based diet. In another meta-analysis of examined 
RCTs, low-fat diets lowered LDL cholesterol, a 
major CVD risk factor, whereas low-carbohydrate 
diets lowered triglycerides (42). 


Cancer 


Cancer includes more than 100 disease types and 
subtypes, precluding a comprehensive assess- 
ment of potential diet effects here, but several 
major trials provide useful evidence. In the low- 
fat diet arm of WHI-DM, there was no significant 
effect on total breast cancer incidence, but estro- 
gen receptor-positive, progesterone receptor- 
negative cancers were significantly reduced by 
36% over a mean of 8.1 years of follow-up (32). 
Among women who had higher baseline fat 
intake (>36.8% of energy), overall risk of breast 
cancer was significantly reduced by 22% over 
a median of 11.5 years. For these women, total 
and breast cancer deaths were reduced by 22% 
and 14%, respectively. However, a low-fat, high- 
carbohydrate intervention conducted in high- 
risk women had no significant effect on incidence 
of invasive breast cancer in another study with 
a mean 10-year follow-up (43). Breast cancer 
patients in the Women’s Intervention Nutrition 
Study randomly assigned to the low-fat diet group 
had a statistically significant 24% reduced risk of 
cancer relapse relative to controls over a median 
of 5 years (44). In another randomized trial among 
breast cancer patients with very low risk of re- 
currence, a low-fat, plant-based diet had no effect 
on recurrence or mortality (45). 
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Specific types of fats may influence prostate 
cancer risk, possibly as a result of effects on cell 
signaling and other cancer-related pathways. In 
the Prostate Cancer Prevention Trial and the 
Selenium and Vitamin E Cancer Prevention Trial, 
higher blood measures of omega-3 (N-3) fatty 
acids, particularly docosahexaenoic acid (DHA), 
were associated with increased risks of both total 
and high-grade prostate cancer (46, 47). These 
findings are consistent with a study in which 
prostate cancer patients were randomly assigned 
to flaxseed supplements [a rich source of the N-3 
fat a-linolenic acid (ALA)] or placebo (48). The 
supplement led to increased tumor proliferation 
and higher prostate-specific antigen (PSA) at 
prostatectomy. However, the clinical implications 
remain unknown; research is needed to determine 
whether specific fatty acids should be reduced 
in people at risk for specific cancers. 


The case for a low-carbohydrate, 
high-fat diet 

Carbohydrate-restricted diets vary in macro- 
nutrient composition, but the defining feature 
is that contributions to total energy are reduced 
for carbohydrate and increased for fat (=40% of 
energy) relative to conventional diets. Emerging 
evidence suggests that a ketogenic diet—a special 
type of low-carbohydrate diet with fat typically 
>70% of energy—may have unique therapeutic 
effects beyond those of less restrictive regimens. 


Physiological mechanism 


Conventional lifestyle recommendations and 
existing drug treatments have failed to stem the 
twin epidemics of obesity and type 2 diabetes. 
Nearly three-fourths of U.S. adults are overweight 
or obese, and half have prediabetes or diabetes, 
despite a 40-year focus on reducing dietary fat. 
The most salient change in macronutrient intake 
over this period has been a marked increase in 
processed starches and added sugars, which sug- 
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gests that they may have a role in the public 
health crisis of diet-related chronic disease (49). 

As dietary carbohydrate is replaced by fat, 
postprandial spikes in the blood concentrations 
of glucose and insulin decrease, glucagon secre- 
tion increases, and metabolism shifts to a greater 
reliance on fat oxidation (Fig. 1). These metabolic 
and hormonal responses are associated with at- 
tenuated oxidative stress and inflammatory re- 
sponses after eating (50, 57), reduced hormone 
resistance [to insulin, leptin, fibroblast growth 
factor-21 (FGF-21), and thyroxine] (52, 53), and 
improvements in many features of metabolic 
syndrome (54-56)—effects that increase through- 
out the range of carbohydrate restriction. Ad- 
ditional mechanisms arise as carbohydrate is 
restricted to a point that results in nutritional 
ketosis, in which serum concentrations of B- 
hydroxybutyrate increase from <0.1 mM to 0.5 to 
5 mM. This normal physiological state differs from 
diabetic ketoacidosis, in which B-hydroxybutyrate 
concentrations exceed 10 mM. Ketones, an al- 
ternative fuel used by the brain (57) and heart, 
affect metabolic efficiency and a panoply of 
signaling functions, producing beneficial changes 
in gene expression, inflammation, oxidative stress, 
and possibly health span (58, 59). 

From a pathophysiological perspective, low- 
carbohydrate, high-fat diets may directly target 
underlying metabolic dysfunction in insulin re- 
sistance and type 2 diabetes, characterized by 
defects in the body’s ability to oxidize ingested 
carbohydrate. With insulin resistance, dietary car- 
bohydrate is diverted at increased rates into hepat- 
ic de novo lipogenesis, resulting in increased 
hepatic triglyceride synthesis and abnormal con- 
centrations of lipids in the blood (60). From a 
historical perspective, some aboriginal hunting 
and fishing cultures (e.g., Inuit of the Arctic and 
First Nations groups in Canada) survived for 
millennia with little available dietary carbohy- 
drate. In fact, mild ketosis was the “normal” meta- 
bolic state for many cultures before the advent 
of agriculture (i.e., for all but the last 1% or less 
of the existence of humans as a species). When 
these ethnic groups underwent a transition from 
their low-carbohydrate and high-fat traditional 
diets, the prevalence of obesity and type 2 diabetes 
increased markedly, although changes in other 
lifestyle factors may have also had a role. 


Obesity, type 2 diabetes, and 
cardiovascular disease 


The most recent systematic reviews and meta- 
analyses have concluded that carbohydrate- 
restricted diets tend to outperform low-fat diets 
for short- to medium-term weight loss, espe- 
cially in trials that involved a ketogenic diet 
(9, 10, 54, 61). Whereas individuals with insulin 
sensitivity seem to respond similarly to low-fat 
or low-carbohydrate diets, those with insulin 
resistance, glucose intolerance, or insulin hy- 
persecretion may lose more weight on a low- 
carbohydrate, high-fat diet (62, 63). The lower 
insulin concentrations and accelerated rates 
of adipose tissue lipolysis and ketogenesis may 
provide more stable metabolic fuel availability, 
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especially for the brain, resulting in greater 
satiety during weight loss; potential effects on 
energy expenditure remain a subject of inves- 
tigation (63). 

Metabolic syndrome—including central adi- 
posity, high circulating concentrations of trigly- 
cerides, low levels of HDL cholesterol, high blood 
pressure, glucose intolerance, fatty liver, and 
chronic inflammation—comprises a constellation 
of clinical risk factors associated with insulin 
resistance that predispose to diabetes and CVD. 
Reduction in dietary carbohydrate may improve 
these markers more effectively than do low-fat 
diets (54-56, 64). In an 8-week trial of patients 
with type 2 diabetes in Italy, a diet high in total 
(42% of energy) and monounsaturated (MUFA) 
fat decreased liver fat significantly more than did 
a low-fat (28% of energy), high-fiber diet (65). In 
a 2-year trial conducted at a worksite in Israel, 
participants in the low-carbohydrate diet group 
(fat approximately 40% of energy) lost more 
weight and experienced greater improvements 
in HDL cholesterol and triglycerides than did 
those in the low-fat diet group (fat approximately 
30% of energy) (66). With restriction of carbohy- 
drate to ketogenic levels (<50 g/day), individ- 
uals with metabolic syndrome lost more 


average glucose concentration) was reduced by 
1.3%, and a majority had HbAIc levels of <6.5% 
(i.e., below the diagnostic threshold for diabe- 
tes) while taking no medications other than 
metformin (11). 

Low-carbohydrate diets are typically (but not 
necessarily) high in saturated fat. As discussed 
below, saturated fat is directly associated with 
cardiovascular and total mortality in the gen- 
eral population (although this relation has been 
a subject of controversy, related in part to the 
nature of the substituted calories) (68, 69). How- 
ever, with the higher rates of fatty acid oxidation 
and decreased de novo lipogenesis on a ketogenic 
diet, blood concentrations of saturated fatty acids 
and palmitoleic acid (a marker of de novo lipo- 
genesis) may decrease (50, 55, 56), suggesting a 
lower risk of diabetes and CVD. Furthermore, 
any effects of increased LDL cholesterol (a risk 
marker for CVD that occurs in about half of 
individuals on a ketogenic diet) need to be con- 
sidered together with improvements in trigly- 
cerides, HDL cholesterol, inflammatory markers, 
and other features of metabolic syndrome. How- 
ever, there are no long-term studies tracking CVD 
outcomes. 


Cancer 

Certain cancer cells rely on glycolysis for energy 
metabolism. By decreasing glucose flux into tumor 
cells, a ketogenic diet could target the defective 
mitochondrial oxidative phosphorylation spe- 
cific to some cancers. Carbohydrate restriction 
might also help to prevent or treat cancer by low- 
ering oxidative stress, inflammation, and cellular 
signaling involving anabolic hormones such as 
insulin (which is thought to mediate in part the 
association between obesity and cancer risk) 
(70, 71). Preclinical data involving various models 
appear promising, including the use of a ketogenic 
diet to enhance the effectiveness of phosphoino- 
sitide 3-kinase (PI3K) inhibitors in cancer treat- 
ment (72). However, clinical reports are largely 
limited to small case series, with no high-quality 
RCTs. 


Clinical translation 


Moderately low-carbohydrate diets entail rela- 
tively simple changes in diet, focused primarily 
on substituting high-fat foods for processed car- 
bohydrates while allowing several daily servings 
of whole fruits, legumes, and minimally processed 
grains. A ketogenic diet may include various 

nutrient-dense whole foods such as non- 


weight, total fat, and abdominal fat than 
did those consuming a low-fat (24% of 
energy), calorie-restricted diet (56). The 
ketogenic diet also significantly decreased 
serum triglycerides, increased HDL cho- 
lesterol concentration, lowered inflamma- 
tory markers, and reduced concentrations 
of circulating saturated fatty acids (50), 
consistent with metabolic benefits seen 
in other studies (67). 

Carbohydrate restriction in general, 
and specifically a ketogenic diet, may 
provide exceptional benefits in the set- 
ting of diabetes, essentially a disease of 
carbohydrate intolerance. Historically, 
ketogenic diets were the treatment of 
choice for diabetes, but the discovery 
of insulin in the early 1920s allowed for 
control of acute symptoms on higher- 
carbohydrate diets. By the 1980s, low- 
fat diets with up to 60% energy from 
carbohydrate had become the standard 
of care, although current recommenda- 
tions emphasize individualizing macro- 
nutrient composition. However, despite 
modern insulin analogs and glucose mon- 
itoring technologies, management of dia- 
betes remains suboptimal. In a recent 
survey, 316 children and adults with type 1 
diabetes following a low-carbohydrate, 
high-fat diet for a mean of >2 years re- 
ported exceptional glycemic control, low 
rates of complications, and excellent 
metabolic health markers (12). Among 
262 participants with type 2 diabetes 
assigned to a ketogenic diet with inten- 
sive telemedicine support, 83% completed 
the 1-year intervention; in this group, 
weight was reduced by 12%, hemoglobin 
Alc (HbAIc, a measure of long-term 
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Box 2. Points of consensus. 


1. With a focus on nutrient quality, good health and low 
chronic disease risk can be achieved for many people on 
diets with a broad range of carbohydrate-to-fat ratios. 


2. Replacement of saturated fat with naturally occurring 
unsaturated fats provides health benefits for the general 
population. Industrially produced trans fats are harmful and 
should be eliminated. The metabolism of saturated fat may differ 
on carbohydrate-restricted diets, an issue that requires study. 
3. Replacement of highly processed carbohydrates (includ- 
ing refined grains, potato products, and free sugars) with 
unprocessed carbohydrates (nonstarchy vegetables, whole 
fruits, legumes, and whole or minimally processed grains) 
provides health benefits. 


4. Biological factors appear to influence responses to diets 
of differing macronutrient composition. People with rela- 
tively normal insulin sensitivity and B cell function may do 
well on diets with a wide range of carbohydrate-to-fat 
ratios; those with insulin resistance, hypersecretion of 
insulin, or glucose intolerance may benefit from a lower- 
carbohydrate, higher-fat diet. 


5. A ketogenic diet may confer particular metabolic benefits 
for some people with abnormal carbohydrate metabolism, 
a possibility that requires long-term study. 


6. Well-formulated low-carbohydrate, high-fat diets do 
not require high intakes of protein or animal products. 
Reduced carbohydrate consumption can be achieved by 
substituting grains, starchy vegetables, and sugars with 
nonhydrogenated plant oils, nuts, seeds, avocado, and 
other high-fat plant foods. 


7. There is broad agreement regarding the fundamental 
components of a healthful diet that can serve to inform 
policy, clinical management, and individual dietary choice. 
Nonetheless, important questions relevant to the epidemics 
of diet-related chronic disease remain. Greater investment in 
nutrition research should assume a high priority. 
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starchy vegetables, nuts, eggs, cheese, but- 
ter/cream, fish, meats, oils, and select fruits. 
Proper formulation of a ketogenic diet 
entails restriction of carbohydrate, adequate 
but not high intake of protein, and suf- 
ficient sodium to offset the natriuretic 
effect of ketosis and reduced insulinemia. 
Recent data among motivated patients sug- 
gest the possibility of good compliance 
and improved quality of life through 1 year 
(1D), although safety has not been fully as- 
sessed in long-term trials. 


The case for dietary fat quality 


At one time, dietary fat, primarily trigly- 
cerides, was considered simply a source 
of energy. However, the fatty acids in 
triglycerides can vary in chain length, 
number and position of double bonds, 
and whether the double bonds are in cis 
or trans configuration. These features 
profoundly affect the biological function 
of fatty acids, and thus their effects on 
heath, in complex, incompletely under- 
stood ways. 

The position of double bonds, described 
by the number of carbons from the non- 
carboxyl end of the fatty acid to the first 
double bond, has particular importance. 
Two families of polyunsaturated fatty 
acids (PUFAs), the N-3 and N-6 fatty acids, 
are essential because they cannot be syn- 
thesized by humans. Both are critical com- 
ponents of every human cell membrane 
and are precursors of eicosanoid hormones 
that mediate inflammation, thrombosis, 
immunity, and insulin resistance. An in- 
crease in N-3 fatty acid intake alters expres- 
sion of more than 6000 genes, underscoring 
this biological complexity (73). 
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SPECIAL SECTION 


DIET AND HEALTH 


A vast literature based on controlled feeding 
studies with physiologic endpoints, long-term 
epidemiologic studies, and randomized trials 
with clinical outcomes has documented that the 
type of dietary fat strongly influences human 
health independent of total fat intake. N-6 and 
N-3 fatty acids provide benefit at intakes above 
minimum levels to prevent essential fatty acid 
deficiency, and nonessential dietary fatty acids 
also have important metabolic effects. 


Obesity and diabetes 


Whereas the literature on total fat intake is ex- 
tensive, little is known about the effects of specific 
types of fat on weight control and body com- 
position. In a 7-week controlled overfeeding study, 
saturated fat increased hepatic and visceral fat 
storage relative to polyunsaturated fat (74). In 
a large cohort analysis (75), increases in the in- 
takes of trans and saturated fat were positively 
associated with weight gain when 
compared isocalorically with carbo- 
hydrate, but intakes of MUFA and 
PUFAs did not influence weight. To 
our knowledge, no RCTs lasting 1 year 
or longer have compared the effects of 
different types of fat on body weight. 
Consistent with the effects of trans 
fat on multiple components of meta- 


about 75%, although lower rates of tobacco use 
and other prevention efforts (e.g., statins) con- 
tributed to this secular trend. 

In subsequent epidemiologic studies, blood 
lipid subfractions predicted CVD better than did 
total cholesterol; higher amounts of LDL choles- 
terol and triglycerides are associated with higher 
risk, whereas higher amounts of HDL cholesterol 
predict lower risk (82). In further controlled feed- 
ing studies, replacement of saturated fat with 
carbohydrates reduced both LDL cholesterol and 
HDL cholesterol and increased blood concen- 
trations of triglyceride during fasting, suggest- 
ing little or potentially adverse effects on risk of 
CVD. Replacement of monounsaturated or poly- 
unsaturated fat with carbohydrate increased LDL 
cholesterol and had minimal effects on HDL 
cholesterol or triglycerides. 

Consistent with the controlled feeding studies 
of blood lipids, in several randomized trials with 


gained widespread recognition. In animals, N-3 
fatty acids protect against cardiac arrhythmias, 
and in epidemiologic studies, intakes of N-3 fatty 
acids [DHA or eicosapentaenoic acid (EPA) from 
fish and ALA from plant sources] are inversely 
but nonlinearly associated with risk of sudden 
cardiac death (88). Specifically, risk decreases 
with intakes up to about 250 mg/day (equivalent 
to one or two servings of fish per week) but then 
plateaus. The inconsistent effects of supplements 
seen in these RCTs may relate to the variability in 
intakes within and among populations (intakes 
among some individuals in the United States 
and mean intakes in many countries remain very 
low) (89). At high dosage, fish oil supplements 
may reduce the risk of cardiovascular events 
such as heart attack and stroke among people 
with hypertriglyceridemia, according to prelim- 
inary data from a large trial (90)—a possibility 
that warrants further study. 
The main N-6 PUFA in diets, linoleic 
acid, can be elongated and desaturated 


Trans fat 


to form eicosanoids that are prothrom- 


Saturated fat 


botic and proinflammatory. In addi- 


tion, linoleic acid may competitively 


inhibit biosynthetic pathways shared 
by the N-3 fatty acid ALA in the for- 


mation of antithrombotic and anti- 


inflammatory eicosanoids. For these 


bolic syndrome (see below), higher reasons, some have concluded that 


Monounsaturated fa 


Change in total mortality, % 


FIGURE 2 REPRINTED WITH PERMISSION OF THE AMERICAN MEDICAL ASSOCIATION 


intake was associated with risk of type 15 higher N-6 fatty acid intake should be 
2 diabetes in a large cohort study with 20 minimized to prevent CVD and other 
repeated measures of diet (76). In a 5 diseases associated with chronic in- 
10-week randomized trial, consumption Polyunsaturated fa’ flammation. However, this reasoning 
of PUFA reduced biomarkers of in- 30 disregards evidence that N-6 PUFA 


sulin resistance relative to consumption 
of saturated fat (77). In a large cohort 
study, the ratio of polyunsaturated 
to saturated fat intake was inversely 
associated with risk of type 2 diabetes 
(76), and relative blood levels of linoleic 
acid, which reflect intake, were inverse- 
ly associated with risk of type 2 dia- 
betes in a pooled analysis of 20 cohort 
studies (78). 


Cardiovascular disease 


Early evidence on dietary fats and CVD was based 
on comparisons of incidence and mortality rates 
across geographical areas, and on knowledge of 
the effects of dietary fats on blood cholesterol 
levels. In the Seven Countries Study (79), per 
capita intake of saturated fat, but not total fat, 
was strongly correlated with rates of CVD; although 
potentially confounded by other variables, this 
provided a strong incentive to understand the 
major geographical variation in CVD rates. In 
controlled feeding studies lasting several weeks, 
compared isocalorically to carbohydrate, satu- 
rated fat increased blood cholesterol concentra- 
tions, whereas PUFA reduced them (80, 87). Thus, 
from the 1960s, dietary advice to reduce CVD 
emphasized replacing saturated fat with PUFAs, 
primarily N-6, and consumption of N-6 PUFA in 
the United States increased from approximately 
3% to 7% of energy. Concurrently, age-adjusted 
coronary heart disease mortality decreased by 
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1 2 3 4 


Increment of energy from specific type of fat, % 


Fig. 2. Relation between increasing intakes of trans, saturated, 
unsaturated, monounsaturated, and polyunsaturated fatty acid 
(compared isocalorically with carbohydrate) in relation to total 
mortality. Data are based on 126,233 men and women followed 

for up to 32 years, with assessments every 4 years, as described in 
Wang et al. (94). The strong inverse association with polyunsaturated 
fatty acids was primarily due to N-6 polyunsaturated fatty acids; 
associations with N-3 polyunsaturated fatty acids were weaker. 


CVD as the outcome, replacement of saturated 
fat with PUFA reduced the risk of CVD, whereas 
replacement with carbohydrate did not (83); how- 
ever, these studies were small, short-term, and 
had other limitations (e.g., a lack of emphasis on 
carbohydrate quality). Long-term prospective 
cohort studies are also consistent with these find- 
ings: When compared isocalorically with saturated 
fat, N-6 PUFAs—but not typical carbohydrates 
in Western diets—are associated with lower risk 
of CVD (84-86). Controlled for other types of fat, 
MUFAs are also inversely associated with risk. 
This inverse association with PUFA is linear up 
to about 8% of energy, beyond which data are 
sparse. These epidemiologic studies also high- 
light the importance of carbohydrate quality; 
relative to saturated fat, whole grains are asso- 
ciated with lower CVD risk (87). 

By the 1990s, the distinction between N-6 and 
N-3 PUFAs and between cis and trans isomers 
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5 intermediates in these pathways, such 


as arachidonic acid, are highly regu- 
lated (92. Although very high intakes 
of N-6 PUFA increase inflammatory 
measures in some animal models, this 
effect has not been convincingly dem- 
onstrated in humans (92); higher in- 
take of linoleic acid in humans may 
actually have anti-inflammatory effects 
(93). Moreover, the ratio of N-6 to N-3 
fatty acids has not been associated 
with risk of CVD, consistent with both 
being beneficial (94). Nonetheless, special effects 
in subgroups or at very low intakes of carbohy- 
drate cannot be ruled out. 

The process of partial hydrogenation, which 
creates trans isomers from the natural cis double 
bonds of fatty acids, was widely used to create 
margarine and vegetable shortening with favor- 
able commercial properties (solidity at room tem- 
perature, long shelf life). This industrial process 
altered the structure and function of linoleic acid 
and ALA, the dominant fatty acids in many widely 
used oils, resulting in major health impacts. Trans 
fat has uniquely adverse effects on LDL, LDL par- 
ticle size, HDL, triglycerides, and inflammatory 
factors (95). In multiple large-cohort studies, in- 
take of trans fat is directly associated with risk of 
coronary heart disease and other chronic illnesses. 
Through regulations, education, and food label- 
ing, trans fat was largely eliminated from the food 
supply in the United States and some European 
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countries. However, intake remains high in some 
parts of the world. 


Cancer 


Mechanistic studies have suggested that both 
N-6 and N-3 fatty acids could either increase or 
reduce cancer risk (46-48), and some animal 
studies have suggested that intakes of PUFA 
beyond the range of typical human diets might 
increase risks. In human studies, consumption 
of these fatty acids and other specific types of fat 
during midlife do not have consistent relation- 
ships to risks of various cancers, according to 
biomarkers of intake and assessments of diet 
(96). Higher intake of fat from animal sources, 
but not vegetable sources, during early adult life 
was associated with higher risk of breast cancer, 
which may reflect the type of fat or nonlipid factors 
(97). Because of long latencies and windows of 
vulnerability for carcinogenic influences, further 
studies of specific types of fat across the lifespan 
are desirable. 


Other outcomes 


Adequate intake of both N-6 and N-3 fatty acids 
in utero and during early life is critical for neuro- 
logical development because these fatty acids 
constitute much of the lipid in the central nervous 
system. Low consumption of fish, the primary 
source of DHA and EPA, during pregnancy is as- 
sociated with lower cognitive function and preterm 
birth (98, 99). In later life, lower consumption of 
N-3 fatty acids and higher consumption of trans 
fats have been associated with greater risk of 
dementia (700). 

In arecent prospective study, 126,233 men and 
women were followed for up to 32 years, with 
diet assessed every 4 years (94). Compared iso- 
calorically to carbohydrate intake, intake of trans 
fat was strongly associated with higher mortality. 
Intakes of MUFA and N-3 PUFA were weakly 
associated with lower mortality, and intake of 
N-6 PUFA was strongly associated with lower 
mortality (Fig. 2). Because of reductions in sat- 
urated and trans fats over the study period, 
total fat intake was inversely associated with 
mortality. 


Conclusion 


The optimal proportion of carbohydrate to fat 
in the diet for obesity treatment and chronic 
disease prevention has been a topic of debate 
for decades, often generating more heat than 
light (202). Of course, any meaningful assessment 
of a diet’s impact on health must extend far beyond 
macronutrient quantity, to include the myriad 
qualitative aspects of food and food combina- 
tions that influence hormonal response, gene 
expression, and metabolic pathways. Further 
complicating this issue is the likelihood that 
inherent or acquired biological differences among 
individuals or populations, especially related to 
glucose homeostasis, affect response to specific 
diets. 

Unfortunately, the national nutrition research 
agenda has not been adequate to address impor- 
tant areas of controversy (Box 1). Currently, the 
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United States invests a fraction of a cent on nu- 
trition research for each dollar spent on treatment 
of diet-related chronic disease. All too often, 
scientific results in this field have been ambig- 
uous: Macronutrient feeding studies have been 
too short and too small to distinguish transient 
from chronic effects; many behavioral trials have 
lacked the intensity to produce meaningful differ- 
ences between dietary treatment groups; and 
observational studies can be affected by con- 
founding, inability to distinguish cause and effect, 
and other methodological problems. Furthermore, 
despite promising preliminary data, few major 
studies of a ketogenic diet in the treatment of di- 
abetes have been conducted. Additional questions 
related to sustainability for the individual (whether 
people can realistically remain on prescribed diets) 
and for the environment (the impacts of specific 
dietary patterns on natural resources and climate 
change) require more study. Given the enormous 
human and economic toll of diet-related disease, 
high-quality research into key controversies should 
be given priority. 

The incomplete nature of research notwith- 
standing, data from multiple lines of investiga- 
tion have led to important areas of consensus 
(Box 2). Current evidence indicates that no 
specific carbohydrate-to-fat ratio in the diet is 
best for the general population. Nor do all diets, 
and calorie sources, have similar metabolic effects 
in everyone. With attention to diet quality—and 
specifically a focus on reducing processed foods, 
including sugar and refined grains—many peo- 
ple do relatively well with substantial variation 
in macronutrient composition (102). For the rapid- 
ly rising proportion of the population with severe 
metabolic dysfunction or diabetes, a more specific 
dietary prescription may be needed. 
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A time to fast 


Andrea Di Francesco, Clara Di Germanio, Michel Bernier, Rafael de Cabo* 


Nutrient composition and caloric intake have traditionally been used to devise optimized 
diets for various phases of life. Adjustment of meal size and frequency have emerged 

as powerful tools to ameliorate and postpone the onset of disease and delay aging, 
whereas periods of fasting, with or without reduced energy intake, can have profound 
health benefits. The underlying physiological processes involve periodic shifts of metabolic 
fuel sources, promotion of repair mechanisms, and the optimization of energy utilization 
for cellular and organismal health. Future research endeavors should be directed to the 
integration of a balanced nutritious diet with controlled meal size and patterns and 
periods of fasting to develop better strategies to prevent, postpone, and treat the 
socioeconomical burden of chronic diseases associated with aging. 


he worldwide increase in life expectancy 

has not been paralleled by an equivalent 

increase in healthy aging. Developed and 

developing countries are facing social and 

economic challenges caused by dispropor- 
tional increases in their elderly populations and 
the accompanying burden of chronic diseases (2). 
Geriatricians and gerontologists have contributed 
greatly to our understanding of the consequences 
and processes that underlie aging from clinical, 
social, mental, physical, and biological perspec- 
tives. The primary goal of aging research is to 
improve the health of older persons and to de- 
sign and test interventions that may prevent or 
delay age-related diseases. Besides socioeconomic 
status, energy, environmental quality, and genet- 
ics are the most powerful determinants of health 
and longevity. Although environmental quality 
and genetics are not under our direct control, 
energy intake is. The consumption of food pro- 
vides energy and nutrients necessary to sustain 
life and allows growth, repair, and reproduction. 
Proper nutrition can influence health and survival 
and delay or, in some cases, prevent the onset and 
progression of chronic diseases. However, both 
hypo- and hypernutrition have the potential to 
increase the risk of chronic disease and pre- 
mature death. Furthermore, manipulation of a 
nutritionally balanced diet, whether by altering 
caloric intake or meal timing, can lead to a delay 
of the onset and progression of diseases and to 
a healthier and longer life in most organisms 
(2-4). In general, both prolonged reduction in 
daily caloric intake and periodic fasting cycles 
have the power to delay the onset of disease and 
increase longevity. Data from experimental studies 
in short-lived species and emerging clinical and 
epidemiological observations indicate that die- 
tary interventions are valuable strategies that 
can be applied to promote healthy aging. In mod- 
el organisms, caloric restriction (CR) provides 
beneficial effects on health and survival, and 
there is an extensive literature that provides in- 
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sights into its molecular mechanisms of action 
(4). However, chronic CR has been reported to 
exert adverse effects for a number of mouse 
strains (5) and could push humans to what one 
may consider a near anorexic state (6), underlying 
some cautionary negative outcomes pointed out 
by the eating-disorders field as well (7-9). 

An emerging area of research is the investi- 
gation of the independent consequences of var- 
jations in meal size (through the control of energy 
intake) and meal frequency (by controlling the 
time of feeding and fasting) on the incidence or 
amelioration of multiple age-related diseases, 
including cardiovascular disease, diabetes, cancer, 
and dementia (2, 70). These studies are starting to 
reveal that health-span and life-span extension 
can be achieved by interventions that do not re- 
quire an overall reduction in caloric intake. Ma- 
jor challenges for the future include the design of 
well-controlled and randomized clinical trials 
to test whether these observations can be trans- 
lated to humans, determination of the impor- 
tance of individual genetic variant medicine, and 
implementation of health care policies in which 
new eating regimens can be integrated into clin- 
ical practice. We discuss four experimental strat- 
egies aimed at altering energy intake or the 
duration of fasting and feeding periods that re- 
sult in improved aspects of health in mammals. 
These are (i) classical CR, in which daily caloric 
intake is typically decreased by 15 to 40%; (ii) time- 
restricted feeding (TRF), which limits daily intake 
of food to a 4- to 12-hour window; (iii) intermittent 
or periodic full or partial fasting, that is a periodic, 
full- or multiday decrease in food intake; and 
(iv) fasting-mimicking diets (FMDs) that use a 
strategy to maintain a physiological fasting-like 
state by reducing caloric intake and modifying 
diet composition but not necessarily fasting 
(Fig. 1). We also summarize the metabolic and 
cellular responses triggered by these feeding reg- 
imens and their impact on physiology, focusing 
on studies in rodents, monkeys, and humans. 
Finally, we discuss how these studies may pro- 
vide a basis for future investigations on the role 
of various dietary interventions in the prevention 
of diseases and promotion of health throughout 
the life span. 
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Caloric restriction 

Early observations linking reduced food intake 
to improvements on health and survival are now 
a century old. Rous discovered that limiting food 
intake had an impact on cancer development 
(11), and Osborne et al. reported growth retar- 
dation and life-span extension by a reduction 
in food intake (72). McCay and colleagues later 
published a seminal paper showing that rats 
fed a limited amount of food lived much longer 
than their ad libitum (AL)-fed littermates (73). 
Nearly a century after these initial studies, the 
positive impact of CR on health span and life 
span has been documented in many model or- 
ganisms that include unicellular yeast, nematode 
worms, fruitflies, mice, and primates, suggesting 
a strong evolutionary conservation of common 
mechanisms connecting food intake to longev- 
ity (4). CR interventions oppose several age- 
associated pathophysiological changes, including 
reduction of metabolic rate and oxidative dam- 
age (1/4), enhanced cellular turnover and protein 
homeostasis, and improvement of age-related 
metabolic disorders that include central obesity, 
insulin resistance, dyslipidemia, and hypertension. 
Studies seeking to unravel the link between me- 
tabolism and aging have demonstrated that CR 
induces profound metabolic and molecular 
changes in components of the nutrient-sensing 
and stress-responsive pathways, such as growth 
hormone, insulin and insulin-like growth factor 
(IGF) signaling, mechanistic target of rapamycin 
(mTOR), adenosine 5'-monophosphate-activated 
protein kinase (AMPK), forkhead box protein O 
(FOXO), sirtuins, and nuclear factor erythroid 
2-related factor 2 (NRF2) (15, 16) (Fig. 2). The 
identification of longevity-regulatory pathways 
led to studies of pharmacological interventions 
with U.S. Food and Drug Administration-approved 
drugs (for example, rapalogs and metformin) and 
other naturally occurring compounds (such as 
resveratrol and spermidine) that mimic changes 
observed in CR without the requirement of changes 
in food intake (17, 18). We focus on the most rel- 
evant molecular and metabolic alterations elicited 
by dietary interventions that involve a reduction 
in caloric intake and fasting periods. 

Numerous strides have been made to under- 
stand how the number of calories, diet compo- 
sition, and timing of the intervention can be 
manipulated to effectively extend longevity and 
to translate these observations into feeding par- 
adigms that can be applied to humans, with the 
goal of delaying the onset of age-associated con- 
ditions and promoting healthy aging. CR without 
malnutrition can be accomplished by chronically 
reducing energy intake by 15 to 40% from AL 
conditions, while maintaining adequate intake 
of vitamins and minerals. In rodents, under cer- 
tain circumstances, CR can extend life span by 
up to 50% (19). Traditionally, diet composition 
and genetic background have been thought to 
have a marginal impact on life-span extension 
elicited by CR. More recently, studies in mice 
and nonhuman primates have revealed that the 
effect of CR on life-span extension is not uni- 
versal (20). Indeed, certain mouse recombinant 
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Fig. 1. Experimental approaches used to improve fitness and promote health span. Description of different feeding regimens, their macronutrient 
balance, and feeding time during a 24-hour period. The feeding time represented in the day-night diagrams refers to eating time in humans and 
nonhuman primates. Mice undergoing a CR regimen tend to finish their food allotment quickly, self-imposing a continuous form of TRF. In human studies, 
people voluntarily reduce their food intake and mostly adhere to a three-meals-per-day schedule. Mean life-span extension is documented for all the 
treatments in the species depicted in the box, whereas maximum life span is only achieved after CR and IF or PF. 


inbred strains show either little increase or del- 
eterious effects on life span after CR (5). Analysis 
of body composition revealed that the best out- 
comes on survival were obtained in mice that 
preserved their fat stores during the second year 
of life, suggesting the necessity of a minimum 
level of adiposity for the full benefit of CR (27, 22). 
In CR regimens, sex, age, and genetic background 
contribute to outcomes regarding health and sur- 
vival in mice (22), and this may also be true for 
long-lived organisms, including humans. Data 
from two independent nonhuman primate studies, 
one at the National Institute on Aging (NIA) 
and the other at the University of Wisconsin- 
Madison (UW), challenged the association be- 
tween life-span extension and health span by 
reporting similar improvements in health but 
contrasting survival benefits in response to CR 
(23, 24). Possible explanations for the divergent 
outcomes emanating from these two studies have 
revealed important differences in genetic back- 
ground, onset of the intervention, feeding prac- 
tices, and diet composition (25). 

In humans, short-term trials such as the multi- 
center CALERIE (Comprehensive Assessment of 
Long-Term Effects of Reducing Intake of Energy) 
study (26-29), the observational studies of cente- 
narians residing in Okinawa who have been ex- 
posed to CR for most of their lives (30), and 
observations of the members of the Calorie Re- 


Di Francesco et al., Science 362, 770-775 (2018) 


striction Society (CRONies) who self-impose CR 
(31) have shown the occurrence of many of the 
same physiological, metabolic, and molecular ben- 
efits typically associated with long-lived animals 
on CR. These studies support the observation 
that long-term CR preserves a more youthful 
functionality by improving several markers of 
health, including decreases in body weight, meta- 
bolic rate, and oxidative damage (J4); lower in- 
cidence of cardiovascular disease (37) and cancer; 
and decreased activity of the insulin-Akt-FOXO 
signaling pathway (32, 33) (Fig. 2). 

Although these findings clearly indicate that 
a reduction of caloric intake could be an ef- 
fective intervention to improve health and pre- 
vent disease during aging in humans, there are 
several obstacles that halt the transition from 
experimental studies into standard medical prac- 
tice: (i) the lack of clinical data supporting con- 
sistent effects of CR in older populations and 
the incomplete understanding of the age-specific 
effects of these interventions (4, 10); (ii) safety 
concerns related to lack of reserve capacity upon 
exposure to infection (34), injury, or surgery 
(35) and about bone thinning that could lead 
to the development of osteoporosis in older in- 
dividuals (36); (iii) the difficulty of compliance 
to extreme restriction; and (iv) the interindividual 
variability in body mass, especially lean mass, 
which strongly correlates with frailty (23). The 
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current “obesogenic” social environment makes 
it difficult for individuals to adhere to strict 
dietary regimens and lifestyle modifications for 
long periods of time. Thus, there is interest in 
alternative feeding regimens that may recapit- 
ulate at least some of the beneficial effects of CR 
by controlling feeding-fasting patterns with little 
or no reduction in caloric intake. 


Time-restricted feeding 


Recent evidence indicates that the benefits of 
CR may not be entirely related to a reduction 
in calories. In many experimental models of 
CR, the reduction in energy intake encourages 
the animals to consume their entire daily food al- 
lowance in a very short interval, thus promot- 
ing a longer fasting period than when consuming 
standard or hypercaloric diets AL (37). Although 
(nocturnal) rodents with free access to food eat 
predominantly at night, they also tend to feed 
during the day, which correlates with gains in 
body weight (37). These observations raise the 
question of whether the timing of food con- 
sumption (either feeding duration or circadian 
timing) is a determinant of metabolic health, 
independent of total caloric intake and quality 
of calories. Thus, it is possible that triggering 
the fasting response on a daily basis or at spe- 
cific times is in itself beneficial. This would ex- 
plain why dietary dilution, a form of CR in which 
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mice eat all day to compensate for the low 
density of energy in their diet, does not result 
in life-span extension (38, 39). Hence, chronic CR 
may improve health, at least in part, through an 
extended period of fasting. 

TRF refers to daily limitations in the timing 
of food intake, spanning from 4 to 12 hours, 


without reduction in caloric intake (Fig. 1). Al- 
though data on the effects of TRF on longevity 
are not yet available, studies in rodents have 
shown that TRF can confer protection against 
several detrimental metabolic consequences of 
a typical western diet (high fat and high carbo- 
hydrates, particularly refined sugars) through 
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Fig. 2. Fasting time and energy restriction share biological responses implicated in metabolite- 
controlled longevity pathways. Reduction of calories by continuous energy restriction or prolonged 
fasting periods trigger metabolic adaptations characterized by increased amounts of circulating 
ketones, whereas circulating fatty acids, amino acids, glucose, and insulin are maintained at 

low concentrations. Adaptive cellular responses involve alterations in the ratios of adenosine 
monophosphate (AMP) to adenosine triphosphate (ATP), of oxidized nicotinamide adenine 
dinucleotide (NAD*) to the reduced form NADH, and of acetyl-CoA to CoA. After a few hours of 
fasting, increased AMP to ATP ratios activate AMPK, which triggers repair and inhibits anabolic 
processes. Acetyl-CoA and NAD* serve as cofactors for epigenetic modifiers such as histone 
acetyltransferases and NAD*-dependent deacetylases, the sirtuins, thus linking nutrition, energy 
metabolism, and post-translational modifications of histone proteins. Sirtuins deacetylate FOXOs 
and peroxisome proliferator-activated receptor y coactivator la (PGC-la), factors respectively 
involved in stress resistance and mitochondrial biogenesis. Production of ketone bodies such as 
B-hydroxybutyrate from fatty-acid catabolism may operate as endogenous histone deacetylase 
(HDAC) inhibitors and may contribute to epigenetic control of gene expression, DNA repair, and 
genome stability. Ketogenesis also promotes synaptic plasticity and neurogenesis by increasing the 
expression of brain-derived neurotrophic factor (BDNF). Periodic cycles of fasting have systemic 
anti-inflammatory effects and increase progenitor stem cells. Down-regulation of the insulin—IGF- 

1 signaling (IIS) pathway and reduction of circulating amino acids repress the activity of mTOR and 
its downstream effector, the ribosomal protein S6 kinase beta-1 (S6K). This mechanism inhibits 
global protein synthesis and promotes recycling of macromolecules by stimulation of autophagy. CR 
promotes the expression and activity of NRF2, which induces a number of antioxidative and 
carcinogen-detoxifying enzymes. Collectively, the organism responds to a low-energy challenge by 
minimizing anabolic processes (synthesis, growth, and reproduction), favoring maintenance systems, 

and enhancing stress resistance, tissue repair, and recycling of damaged molecules. Improvement in 
resilience, metabolic homeostasis, tissue repair, and organismal function can act as direct modifiers of the 
four domains of the aging phenotype: body composition (1); balance between energy availability and 
energy demand (27); signaling networks that maintain homeostasis (81); and neurodegeneration (4). Each 
of these domains can be assessed readily by routine clinical tests. 
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reduction in body weight, increase in energy 
expenditure, improved glycemic control and 
lower insulin levels, decrease in hepatic fat and 
hyperlipidemia, and attenuated inflammatory 
outcomes, even when food intake or body weight 
or both are matched to the control group (40-43). 
The molecular mechanisms responsible for the 
effects of altered meal patterns on metabolic 
health appears to be related, at least in part, to 
the synchronization between the time of fasting- 
feeding and the circadian rhythm (3) (Fig. 3). 
The circadian clock provides a conserved mech- 
anism that allows organisms to anticipate and 
respond to environmental changes. This per- 
petual rhythm leads to the timely expression of 
clock-controlled genes, especially those encom- 
passing enzymes and regulatory molecules that 
mediate physiological and metabolic functions. 
A strong relation exists between the circadian 
clock and metabolism, as they share some com- 
mon regulators. Indeed, TRF can restore cycling 
of metabolic regulators, such as nicotinamide 
phosphoribosyltransferase (NAMPT), cAMP re- 
sponse element-binding protein, mTOR, AMPK, 
or the insulin signaling pathway, all of which 
take part in the life-span and health-span ben- 
efits of CR (40) (Fig. 3). 

The NIA and UW CR monkey studies showed 
that the genetic background, age of onset of the 
intervention, and diet composition per se were 
not sufficient to explain the differences observed 
in longevity under both control diet and CR. 
Upon completion of the studies, the two research 
teams came to realize the notable differences in 
the feeding regimen (25), whereby UW monkeys 
were fed in the morning and the food was re- 
moved in the afternoon when another small 
treat, such as a piece of fruit, was offered. This 
protocol caused the animals to eat during the 
day and fast overnight. By contrast, NIA animals 
were fed twice daily, without removal of the sec- 
ond meal, thus virtually excluding the possibility 
of an overnight fast. To further shed light on the 
interaction between diet composition and eating 
patterns in a genetically homogeneous animal 
model, we recently compared the survival of mice 
fed the same diets used in the two nonhuman 
primate studies (NIA and UW) under AL, 30% 
CR, or a daily single meal feeding (MF) to match 
the calories consumed by the AL animals. Al- 
though both CR and MF mice showed increased 
life span compared with the AL groups, the ef- 
fect was independent of the diet composition. 
Both the CR and MF mice self-imposed a TRF 
paradigm, and the life-span and health-span 
extension seen in those groups appeared to be 
directly proportional to the time spent fasting 
(44). Similar behavior was reported in mice 
under CR, which voluntarily adopted a TRF par- 
adigm, as measured by an automated system 
that recorded time of food availability and 
consumption (37). 

Outcomes from TRF trials in humans also ap- 
pear to depend on the distribution of meals dur- 
ing the day and the duration of fasting (45-49). 
Limiting food intake to the middle of the day 
decreased body weight or body fat, fasting glucose 
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and insulin levels, insulin resistance, hyperlip- 
idemia, and inflammation and produced mild 
caloric restriction and weight loss, without calorie 
counting (46, 47, 50, 57). Similarly, metabolic 
markers were improved in a group of people 
eating an isocaloric diet with a bigger breakfast 
and a smaller dinner (52, 53), and type 2 diabetic 
patients under hypocaloric diet obtained better 
metabolic outcome by eating most of their daily 
allotment in the first half of the day rather than 
divided into six meals throughout the day (54). 
On the contrary, restricting food intake to the 
late afternoon or evening either produced mostly 
null results or worsened glucose levels after eat- 
ing, B cell responsiveness, blood pressure, and 
lipid levels (45, 48, 49). A strictly controlled 
feeding trial tested prediabetic subjects that 
were allowed to eat their meals in either a 6-hour 
time window in the morning (before 3 p.m.) or 
a 12-hour time window for 5 weeks. Early TRF 
ameliorated the metabolic markers of diabetes 
without a significant reduction of body weight 
(55). Individuals on a hypocaloric, three-meal- 
per-day diet lost more weight when the major- 
ity of the food was consumed in the morning, as 
opposed to the evening (57). However, no signif- 
icant changes in glycemia, insulin sensitivity, or 
respiratory exchange ratio (RER, defined as the 
ratio between the amount of carbon dioxide pro- 
duced and oxygen used during breathing) were 
observed when obese, insulin-resistant men were 
exposed to a hypocaloric diet with food provided 
in the morning (56). In the context of cancer, two 
studies found that a fasting period of more than 
13 hours resulted in lower risk of breast cancer 
recurrence than that in subjects who fasted less 
than 13 hours (57, 58). The discrepancies remain 
to be explained, but collectively, given the pre- 
sent body of knowledge, these studies indicate 
that both the amount of time spent eating during 
each day and the time at which food is consumed 
relative to the circadian rhythm are critically 
important to the effects of diet on health and 
longevity. 


Intermittent and periodic fasting 


An increasingly popular alternative to both con- 
tinuous CR and TRF is intermittent fasting (IF), 
an eating pattern in which no or few calories 
are consumed for periods of time that range 
from one to several days, followed by AL feeding 
on the remaining days (J0) (Fig. 1). One example 
of IF is the 24-hour water fast without solid food 
followed by a normal feeding period of 24 hours. 
This alternate-day fasting differs somewhat from 
the alternate-day modified fasting in which 
participants consume very few calories one day 
(e.g., 25% of usual intake) followed by a day with- 
out restrictions. 

Both CR and fasting promote stress resist- 
ance in model organisms ranging from uni- 
cellular yeast to mammals, presumably by shifting 
energy from growth and reproduction to mainte- 
nance, recycling, and repair in order to increase 
cellular protection and survival (Fig. 2). There is 
an abundance of data that supports this hy- 
pothesis (59). From an evolutionary perspective, 
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fasting is a natural phenomenon to which both 
humans and lower organisms were regularly ex- 
posed. Although many animals in the wild still 
encounter prolonged periods of time with little 
or no food, humans have rapidly transitioned into 
a sedentary lifestyle accompanied by a continuous 
and abundant supply of food. In a bygone era, the 
postabsorptive, or fasting, state triggered hunger 
and food-seeking behavior that took hours, some- 
times days, to satisfy. Mechanisms to survive such 
periods of fasting are believed to have pleiotropic 
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benefits. However, in our present-day civiliza- 
tion, hunger and food-seeking behavior result 
in instant gratification and alterations in eating 
patterns characterized by the consumption of 
high-energy meals as soon as the urge arises, 
thus negating the putative benefits that periods 
without food provide. This behavior might be a 
major contributor to the emergence of the obesity 
epidemic and obesity-related morbidities (2). 

In rodents, IF extends life span (60) and pro- 
tects against obesity, cardiovascular disease, 
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Fig. 3. Integration of the circadian rhythms and feeding-fasting cycles with metabolism. 
(Left) The transcriptional activators BMAL1 and CLOCK are at the core of a cell-autonomous 
molecular circuit that governs circadian rhythms. Fasting increases hunger, the extent of which 
depends on the overall energy intake, diet composition, and length of fasting. The internal circadian 


clock also increases hunger independent of food intake and other behaviors. Intermittent energy 
restriction increases concentrations of the plasma membrane redox system enzymes, NADH- 
cytochrome b5 reductase and NAD(P)H-quinone oxidoreductase, contributing to oscillations in the 
NAD(P)H [reduced form of NAD(P)*] to NAD(P)* (nicotinamide adenine dinucleotide phosphate) 
ratio (82). The circadian rhythmicity of CLOCK and BMAL1 expression regulates the transcription of 
NAMPT, a key regulatory enzyme involved in the generation of NAD*, a metabolite required 

for the deacetylase activity of SIRT1. Active SIRT1 influences metabolism through its effects on 
catabolic and anabolic reactions and mediates BMAL1 deacetylation, which inhibits the circadian 
clock machinery. During the active phase (yellow boxes), increased production of ATP 


sustains anabolic pathways. During the resting phase (green boxes), a shift toward AMP gears 
metabolism toward catabolic processes. These intermediate energy carriers activate downstream 
transcription factors, kinases, and deacetylases like NRF2, AMPK, PGC-la, sirtuins, and FOXOs, 
whose activation influences health and survival. (Right) At the organismal level, fasting or 
feeding states are paralleled by changes in the metabolic rate. AL-fed animals set their RERs at 
around 0.9, showing an intermediate preference between fat and carbohydrate metabolism. 
Both CR and TRF regimens increase the amplitude of RER oscillations, characterized by 

higher RER (utilization of carbohydrates) during feeding and lower RER (utilization of lipids) 
during fasting. Under prolonged fasting, lipids are the only source of energy, as opposed 

to feeding time. The FMD diet results in lower RER with a slight peak after the meal. The RER 
traces are idealized and may be close to what is seen in nocturnal rodents. 
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Fig. 4. Systemic effects of caloric restriction or intermittent fasting. The balance between 
reduction in total food intake and timing contributes to differences in energy consumption, leading to 
changes in circulating factors and organ function. The height of each arrow does not reflect the 
intensity level but instead highlights a grouping. Down arrows indicate decreased levels, and up 


arrows indicate increased levels. 


hypertension, diabetes, and neurodegenerative 
diseases (2). It also retards the growth of tumors 
(61) and sensitizes a range of cancer cell types to 
chemotherapy (62). Additional benefits of IF 
include improvement in insulin sensitivity, in- 
dependently of both total food intake and weight 
loss (63), and enhancement in brain function as 
evidenced by better performance on behavioral 
tests that assess motor responses to sensory 
stimuli (64). The behavioral responses to IF are 
associated with increased synaptic plasticity and 
increased production of new neurons from neu- 
ral stem cells (2). Similar results were achieved 
when animals were fed a ketogenic diet (65) that 
is composed almost exclusively of fat. In mice, a 
ketogenic diet improved health span and delayed 
age-associated neurological decline, without an 
effect on longevity (66, 67). In the study of Verdin 
and colleagues, feeding a ketogenic diet to mice 
caused weight gain without life-span extension 
when compared to the controls, but a cyclic ke- 
togenic diet reduced midlife mortality, whereas 
in the second study, mice consuming a ketogenic 
diet isocaloric to that of the control group had 
increased life span despite having similar body 
weight (66, 67). The benefits of fasting may be 
mediated by a highly conserved stress-response 
or nutrient-sensing pathway, whereby organis- 
mal metabolism switches from storage to mobi- 
lization, in part, by increasing autophagy and 
recycling at the cellular level. The ensuing pro- 
duction and utilization of fatty acid-derived ke- 
tones serve to preserve the brain and muscle 
function and allow the organism to withstand 
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extended periods of food shortage. Production 
of ketone bodies (f-hydroxybutyrate and aceto- 
acetate) by the liver and gut epithelial cells (68) 
during B-oxidation of fatty acids to acetyl-coenzyme 
A (CoA) or conversion of ketogenic amino acids 
or both, are released into the bloodstream to 
provide metabolic fuel to various organs (Fig. 4). 

Several short-term human clinical trials have 
shown that alternate-day fasting can deliver ben- 
efits similar to CR in terms of weight loss and 
cardiometabolic health, including reduction in 
body weight and improved lipid profiles, lower 
blood pressure, and increased insulin sensitivity 
(69-71). In cancer patients, fasting selectively 
protects normal cells, but not cancerous cells, 
against toxicity related to chemotherapeutic 
agents, and fasting for up to 5 days followed by 
a normal diet appear to be a safe, feasible, and 
effective strategy in reducing common side effects 
associated with chemotherapy (72). Nevertheless, 
the challenges of implementing alternate-day 
fasting or similar interventions are real and can 
be a major burden by causing difficulties rem- 
iniscent to those encountered by individuals on 
chronic CR. 


Fasting-mimicking diets 

Implementing a long-term calorie restriction 
or complete fasting can be challenging in hu- 
mans, and compliance tends to decrease with 
time. In a human study, the withdrawal rates of 
CR and periodic fasting (PF) participants from a 
12-month study were about 40 and 30%, respec- 
tively (73). In the 2-year CALERIE clinical trial, 
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the compliance rate was reduced to 82% with 
the restriction achieved averaging -12%, which 
was half the targeted reduction (28). The suc- 
cess of the long-term CRONies study in which 
individuals of the CR group ate nearly half of 
the calories compared to the AL subjects (1112 to 
1958 kcal/day compared with 1976 to 3537 kcal/day) 
relied upon the strong motivation of its partic- 
ipants (37). There are side effects associated with 
prolonged periods of daily fasting (>15 hours) for 
human health. Studies associating such extended 
fasting periods and skipping breakfast with mor- 
tality and disease in humans have been reported 
(74, 75), although most long-lived populations 
from around the globe do practice 12- to 13-hour 
TRF with little to no adverse effects (5). 

To make fasting acceptable to most people, Longo 
and colleagues conceived a low-carbohydrate, 
high-fat diet that enhances compliance by avoid- 
ing complete deprivation of food. The diet coined 
as FMD (Fig. 1) has low calories and provides for 
plant-based soups, herbal tea, energy bars, nut- 
based snacks, and supplements to be gradually 
implemented in a 5-day cycle each month for 
3 months (76). For the first day, the caloric amount 
is about 1090 kcal (10% protein, 56% fat, and 34% 
carbohydrate) and, for days 2 to 5, only 725 kcal 
are provided (9% protein, 44% fat, and 47% car- 
bohydrate) (76). A similar diet was also designed 
for laboratory mice, whereby animals were al- 
lowed to consume 50% of the AL food supplied 
as a vegetable-based powder mixed with hydro- 
gel on day 1 and reduced to 10% of AL on days 2 
through 4. The main goal of the FMD is to 
maintain low circulating concentrations of IGF-1, 
insulin, and glucose, while increasing plasma 
concentrations of IGF-binding protein 1 and 
ketone bodies. Rejuvenating effects of FMD 
were initially reported through an increased 
number of progenitor stem cells (77). The FMD 
regimen also leads to improvement of markers 
of diseases and metabolic dysfunction, lowers 
cancer incidence, and extends health span in 
rodents, but without affecting maximum lon- 
gevity (76, 78). More recently, FMD has been 
proposed to exert a therapeutic, antidiabetic 
effect by fostering regeneration of pancreatic 
6 cells and restoring insulin secretion and glucose 
homeostasis in mice through the regulation of 
protein kinase A (PKA) and mTOR pathways 
(77). Furthermore, this dietary intervention im- 
proves the control of autoimmune disease—for 
example, multiple sclerosis—through regulation 
of the immune system (79). Despite the positive 
results on health span, FMD did not increase 
maximum longevity in mice, and, when admin- 
istered to very old animals, it may have been 
detrimental (76). 


Conclusions 


Pharmacological interventions with proven ef- 
fectiveness are often accompanied by a range 
of unwanted side effects. Many elderly people 
take multiple medications, which can cause ad- 
verse geriatric outcomes linked to increases in 
morbidity and mortality (80). Dietary interventions 
that are accompanied by long fasting periods 
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have emerged as promising strategies to target 
a myriad of clinical parameters that constitute 
the foundation for metabolic syndrome, car- 
diovascular disease, cancer, and even neuro- 
degenerative diseases (2). Although the specific 
mechanisms are far from being fully understood, 
this periodic absence of energy intake appears to 
improve multiple risk factors and, in some cases, 
reverse disease progression in mice and humans. 
Thus, the time is ripe to add to our understand- 
ing of the molecular mechanisms of action and 
efficacy of these dietary interventions to the 
foundation for future clinical trials. It will be 
important to understand and consider the dif- 
ferences in metabolic rate, equivalence of fast- 
ing times, role of diet composition, and the 
duration of the intervention as they relate to a 
particular disease or clinical target. Although 
current mouse models have not been very use- 
ful in identifying translatable therapies for some 
of the most common chronic diseases (e.g., Alz- 
heimer’s and cardiovascular disease), it should 
be acknowleged that most experiments have 
been limited to a few inbred stains. It may be 
important to move to other model organisms 
closer to human physiology and etiology—such 
as pigs, dogs, and nonhuman primates—to test, 
develop, and translate interventions. Strategies 
could use different forms of CR, FMD, PF, and 
TRF in combination with known pharmacological 
interventions and CR mimetics. We expect that 
these dietary interventions combined with classi- 
cal pharmacology and clinical practice will yield 
interventions that will improve human health 
and enhance health span and quality of life as we 
grow old. Although promising, these approaches 
are still experimental in nature and should not 
be initiated without medical supervision. 
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The gut microbiota at the intersection 
of diet and human health 


Christopher L. Gentile and Tiffany L. Weir* 


Diet affects multiple facets of human health and is inextricably linked to chronic metabolic 
conditions such as obesity, type 2 diabetes, and cardiovascular disease. Dietary nutrients 
are essential not only for human health but also for the health and survival of the trillions 
of microbes that reside within the human intestines. Diet is a key component of the 
relationship between humans and their microbial residents; gut microbes use ingested 
nutrients for fundamental biological processes, and the metabolic outputs of those 
processes may have important impacts on human physiology. Studies in humans and 
animal models are beginning to unravel the underpinnings of this relationship, and 
increasing evidence suggests that it may underlie some of the broader effects of diet 


on human health and disease. 


ontroversy regarding what constitutes a 
healthful diet has persisted since the ad- 
vent of nutrition as a scientific discipline 
and establishment of government nutri- 
tional guidelines (7). The emergence of the 
gut microbiota as a key regulator of health and 
disease has further complicated this issue. A 
mutualistic relation exists between diet and the 
gut microbiota so that dietary factors are among 
the most potent modulators of microbiota com- 
position and function. Intestinal microbes in 
turn influence the absorption, metabolism, and 
storage of ingested nutrients, with potentially 
profound effects on host physiology. 

The human gut microbiota consists of trillions 
of microbial cells and thousands of bacterial 
species. The specific compositional features differ 
among individuals, and although the mature 
microbiota is fairly resilient, it can be altered 
within individuals by both internal and exter- 
nal stimuli. Interindividual variability and the 
plasticity of the gut microbiota have hindered 
efforts to identify a “healthy” microbiota, al- 
though markers of microbial stability, such as 
richness and diversity, are often used as indi- 
cators of gut health because of their inverse 
association with chronic disease and metabolic 
dysfunction (2). Microbiota plasticity also creates 
a distinct opportunity; by manipulating various 
external factors, the potential exists to reshape 
the architecture and biological outputs of gut 
microbes for improved human health. 

Diet is an important external factor affecting 
the gut microbiota, and diet’s ability to alter mi- 
crobial ecology was first recognized more than a 
century ago (3). Transient diet-induced altera- 
tions occur independently of body weight and 
adiposity and are detectable in humans within 
24 to 48 hours after dietary manipulation (4). 
The effects of diet on microbial ecology are un- 
surprising when one considers that gut microbes, 
like their human hosts, use ingested nutrients as 
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fuel for fundamental biological processes. Thus, 
changes to host dietary patterns alter bacterial 
metabolism and favor species most suited to use 
consumed fuel sources. What was not predicted 
after the initial observations a century ago, and 
has only come to light in recent decades, is the 
important effect that diet-induced changes in 
microbial structure have on human physiology 
and disease processes. 


Nutrients 
Microbiota-accessible carbohydrates 


When studied in isolation, each of the major 
macronutrients and numerous micronutrients 
have been shown to modify the gut microbiome. 
Among the macronutrients, the effects of die- 
tary carbohydrates (CHO) are best characterized. 
Simple CHO such as sucrose, both alone and as 
part of a Western-style high-fat high-sugar diet, 
cause rapid microbiota remodeling and meta- 
bolic dysfunction in experimental animals (5, 6). 
Complex CHO exhibit a diverse array of mono- 
saccharide linkages, many of which are indi- 
gestible by humans. Gut microbes, on the other 
hand, possess several hundredfold more CHO- 
degrading enzymes and thus use indigestible 
CHO as their primary energy source. The term 
“fiber” is commonly used to describe these in- 
digestible CHO, although this designation is 
problematic given that some fibers are not used 
by gut microbes (such as cellulose), whereas 
other readily fermented CHO fall outside of the 
definition of fiber (such as resistant starches). 
Sonnenburg and colleagues (7) recently proposed 
the term “microbiota-accessible carbohydrate,” 
or MAC, to describe CHO that are metabolically 
available to gut microbes, and we use that ter- 
minology hereafter. 

Several lines of evidence indicate that alter- 
ations in dietary MACs have important effects 
on microbiota composition and function. Agro- 
nomic and nomadic hunter-gatherer societies 
that consume high levels of MACs display greater 
microbial richness and diversity as compared 
with those of industrialized societies (8, 9). Diets 
high in MACs alter microbiota composition in 
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humans within days or weeks (10, 11). Mice fed 
a diet low in MACs experience decreases in 
numerous taxa, and loss of diversity is com- 
pounded over several generations of offspring 
and not recovered after reintroduction of MACs 
(7, 9). Reductions in bacterial abundance with 
low MAC intake are not observed uniformly 
across all bacterial taxa because certain bac- 
terial species that typically consume dietary 
glycoproteins can also use glycoproteins of the 
intestinal mucus layer as an alternative energy 
source. Over-grazing of the mucus layer by these 
species may be an important consequence of 
MAC restriction, as chronic foraging has been 
shown to compromise barrier integrity and en- 
hance inflammation and pathogen susceptibil- 
ity in animal studies (72, 13). 

Another consequence of MAC restriction is a 
reduction in short chain fatty acid (SCFA) pro- 
duction. SCFAs, the primary end products of 
bacterial fermentation, represent an excellent 
example of mutualism between humans and 
their bacterial symbionts. MACs provide a crit- 
ical energy source for gut bacteria, and the con- 
sequent production of SCFAs benefits the host by 
serving as both recovered energy from otherwise 
inaccessible carbohydrates as well as potent regu- 
latory molecules with vast physiological effects 
(Fig. 1). SCFAs signal via the central nervous sys- 
tem and several G protein-coupled receptors 
(GPCRs) to modulate a range of physiological 
processes, including energy homeostasis, lipid 
and carbohydrate metabolism, and suppression 
of inflammatory signals (14, 15). Two SCFAs, 
butyrate and propionate, also act as histone de- 
acetylase inhibitors, suggesting that they can 
epigenetically influence host gene expression 
(16). Thus, decreased SCFA production and in- 
creased mucus foraging represent two microbiota- 
dependent consequences of low MAC intake, and 
it is tempting to speculate that these processes 
underlie the long-recognized health benefits of 
high-fiber diets. It should be noted that the ex- 
tent of mucus foraging in humans, and its impor- 
tance in human disease processes, have not been 
directly examined. Furthermore, despite the pro- 
tective effects of SCFAs observed in preclinical 
models, obese humans and genetically obese 
mice display increased fecal and caecal concen- 
trations of SCFA (17), suggesting that they may 
contribute to enhanced energy harvest. Thus, 
energy balance status of an individual may de- 
termine whether the beneficial effects of SCFA 
signaling on metabolism outweigh the additional 
calories harvested (18). 

It is important to recognize that MACs repre- 
sent a diverse group of oligo- and polysaccharides 
with considerable structural heterogeneity and 
diverse effects on microbial ecology. A specific 
subset of MACs have been termed “prebiotic,” 
which originally described a class of oligosac- 
charides that selectively enhance growth of 
Bifidobacterium and Lactobacillus (19). These 
canonical prebiotics, primarily fructo- and galac- 
tooligosaccharides of varying chain lengths, have 
been shown to alter members of the human gut 
microbiota and modulate inflammation and 
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markers of metabolic syndrome (20, 21). Despite 
promising data on oligosaccharide use for appe- 
tite regulation and obesity-related complications, 
therapeutic efficacy of these prebiotics in treat- 
ing gastrointestinal conditions is variable (22). 
Several studies have shown that restricting oli- 
gosaccharides and other fermentable sugars [a 
low-FODMAPs (fermentable oligosaccharides, 
disaccharides, monosaccharides, and polyols) 
diet] alleviates symptoms of irritable bowel syn- 
drome (23). 

Technological advances that allow for holistic 
examination of microbial responses to dietary 
components have led to a recent ex- 
pansion of the prebiotic concept (24). 
Although selective use of a dietary 
substrate by specific microbial pop- 
ulations is still required, the list of 
potential substrates and microbial 


targets is more inclusive. For example, Intestinal eae . 
epithelial cells ee? “be 


candidate prebiotic substrates now _[ 
include nonpolysaccharide dietary ( 
components such as polyunsaturated 
fats, conjugated linoleic acid, and 
phytochemicals/phenolics (24, 25). 
Likewise, prebiotics may be selec- 
tively used by bacteria other than 
Bifidobacterium and Lactobacillus, 
provided the net effects on host health 


MAC 


For example, mice fed an isocaloric diet rich in 
long-chain saturated fats derived primarily from 
meat products displayed greater insulin resist- 
ance and adipose tissue inflammation as com- 
pared with that of mice fed a high-fish oil diet. 
These metabolic disturbances were accompanied 
by reductions in phylogenetic diversity in the sat- 
urated fat-fed mice, and receipt of transplanted 
microbiota from mice fed fish oil abrogated 
saturated fat-induced inflammation (37). Fur- 
thermore, transgenic mice that constitutively 
produce 3 polyunsaturated fatty acids possess a 
microbiome with enhanced phylogenetic diver- 


MAC fermentors 
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are beneficial. As a result of these ex- 
panded inclusion criteria, studies that 
characterize distinct host-microbe- 
substrate interactions are of partic- 
ular interest. 


Dietary fats 


An increase in dietary fat also sub- 
stantially alters microbiota com- 
position. Experimental mice fed a 
high-fat diet (40 to 80% total caloric 
intake) exhibit decreases at the phyla- 
level in Bacteroidetes and increases 
in Firmicutes and Proteobacteria. 
These changes were observed in mice 
resistant to weight gain, implying a 
direct effect of dietary lipids on the microbiota 
(26, 27). Germ-free (GF) mice are protected from 
the metabolic consequences of high-fat diets, 
suggesting that gut microbes may be important 
mediators of lipid-induced metabolic dysfunction 
(28). The metabolic protection in GF mice may 
be due to enhanced fat oxidation or reduced 
absorption in the small intestine (29). Microbes 
in the small intestine were recently found to 
be highly susceptible to fat load and essential 
for lipid digestion and absorption (17). These 
data suggest that regional specificity of micro- 
biota composition may have important func- 
tional consequences and highlight the need for 
spatially distinct analyses along the gastroin- 
testinal tract. Importantly, not all studies have 
found that GF mice are protected from the 
metabolic consequences of high-fat feeding, and 
the cholesterol content of the diet may be an 
important determining factor (30). As with car- 
bohydrates, lipid-mediated effects on the micro- 
biota are dependent on the lipid type and source. 
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Metabolized 
in liver 


Fig. 1. MAC fermentors produce SCFAs that can have multiple 
interactions with host tissues. Butyrate is taken up by epithelial cells 
and used as a primary source of energy for these cells. Butyrate (and to 
a lesser degree, propionate) can block histone deacetylases (HDAC) 

to regulate gene expression. All of the SCFAs can bind with varying affinities 
to G protein receptors in the intestines and other cells to regulate energy 
metabolism, intestinal homeostasis, and immune responses. Acetate and 
propionate are primarily metabolized in the liver, where propionate is 
used as a substrate for gluconeogenesis and acetate is used as an energy 
source and for fatty acid synthesis. 


sity that offers protection against the metabolic 
consequences of a high-saturated-fat, high-sugar 
diet (32). 

One mechanism by which gut microbes may 
mediate the metabolic consequences of high-fat 
intake is through translocation of lipopolysac- 
charide (LPS), a cell-wall component of gram- 
negative bacteria. Increases in circulating LPS 
have been reported in humans after a high-fat 
meal, with exacerbated effects in obese indi- 
viduals (33). Once in circulation, LPS elicits a 
potent inflammatory response via Toll-like 4 
receptor signaling, which has been implicated 
in the development of cardiovascular and me- 
tabolic disease (34). Although existing data that 
link circulating LPS to cardiometabolic distur- 
bances are compelling, progress in this area has 
been hindered by the inability of available assays 
to distinguish between stimulatory and non- 
stimulatory LPS, as well as by circulating inhibi- 
tors that reduce accuracy of LPS quantification 
(35). Furthermore, although circulating LPS has 
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been reported in obese individuals and corre- 
lates with markers of metabolic disease, a direct 
causal role in human disease has not been exam- 
ined (36). 

Primary bile acids are produced in the liver 
from cholesterol and facilitate the digestion of 
dietary lipids. Once generated, primary bile acids 
are secreted into the small intestine, where they 
facilitate the solubilization and absorption of 
lipids. Microbial alterations to primary bile acids 
include hydrolysis of conjugated amino acids, 
7o/B-dehydroxylation, and oxidation and epimer- 
ization of hydroxyl groups at various positions 
(Fig. 2) (37). GF mice display in- 
creased abundance and reduced di- 
versity of bile acids compared with 
that of conventional mice (38), and 
enrichment of specific secondary 
bile acids has been observed in colo- 
rectal cancer cases (39). In addi- 
tion to their canonical role in aiding 
lipid digestion, bile acids act as dy- 
namic signaling molecules via the 
farnesoid X receptor (FXR) and the 
G protein-coupled bile acid recep- 
tor 1 (TGRS5). Like SCFAs, bile acids 
have been shown to regulate energy 
homeostasis, glucose metabolism, 
and innate immunity (40). More re- 
cent data suggest that gut microbes 
also have direct effects on FXR and 
TGR5 expression and signaling 
(40, 41). Thus, the gut microbiota 
helps regulate bile acid composi- 
tion, abundance, and signaling, and 
this regulation may have important 
implications not only for lipid di- 
gestion and absorption but also for 
the development and prevention of 
metabolic disease. Several bile acid- 
directed therapeutics are currently 
being examined for obesity-related 
conditions, and as the clinical utility 
of these therapeutics is tested, it will 
be important to consider microbiota 
composition in determining interindividual ef- 
ficacy and safety. 


@ Acetate 


Dietary protein 


Dietary proteins also modulate microbial com- 
position and metabolite production, with amino 
acids providing gut microbes essential carbon 
and nitrogen. Amino acid catabolism yields nu- 
merous metabolites that affect host physiology 
(Fig. 3). For example, although SCFAs are derived 
mainly from MAC fermentation, they are also by- 
products of bacterial metabolism of amino acids. 
The relative contribution of amino acid metab- 
olism to total SCFA production is unclear, but 
total protein and fiber intake are influencing 
factors. Additional metabolites of amino acid 
catabolism include branched chain fatty acids, 
indoles, phenols, ammonia, and amines, all of 
which can affect human health. For example, 
phenols, indoles, and amines can combine with 
nitric oxide to form genotoxic N-nitroso com- 
pounds that are associated with gastrointestinal 
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cancers in human populations (42). By contrast, 
indolepropionic acid, a microbial metabolite of 
tryptophan, maintains intestinal homeostasis and 
protects from experimental colitis (43). Indole-3- 
acetate, another bacterially derived tryptophan 
metabolite, was recently shown to reduce hepa- 
tocyte and macrophage inflammation (44). The 
source of dietary protein also determines the 
nature of microbiota-dependent metabolic out- 
puts. This is perhaps best exemplified by pro- 
duction of the compound trimethylamine oxide 
(TMAO) from the amino acid L-carnitine, which 
is abundant in animal but not vegetable protein. 
TMAO is predictive of cardiovascular 
events in various populations (45) and 
has been implicated in the develop- 
ment of fatty liver disease (46). A re- 
cently developed inhibitor of TMAO 
production reduced platelet aggrega- 
tion and thrombus formation in ex- 
perimental animals, enhancing the 
potential of TMAO-directed therapies 
(47). Collectively, these data highlight 
that the vast effects of microbiota- 
derived amino acid metabolites on host 
physiology are only now beginning to 
emerge and represent an area ripe for 
future research. 


Micronutrients 


In addition to major macronutrients, 
the gut microbiota regulates both syn- 
thesis and metabolic output of various 
micronutrients. The B vitamins, for 
example, can be synthesized by more 
than 100 bacterial species, and anal- 
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found to suppress virulence of the rodent enteric 
pathogen Citrobacter rodentium, essentially con- 
verting the pathogen to a commensal microbe 
(53). High salt intake has been implicated in the 
cardiovascular consequences of Western diets. 
Recent data suggest that the hypertensive ef- 
fects of high-salt diets in experimental animals 
and humans are mediated by reduced levels 
of Lactobacillus and subsequent increases in 
proinflammatory T helper 17 cells (54). Collect- 
ively, the interactions identified thus far between 
the microbiota and micronutrients, as well as 
the myriad other interactions that undoubtedly 
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For example, administration of two dietary emul- 
sifiers, polysorbate-80 and carboxymethyl cellu- 
lose, induced obesity, intestinal inflammation, 
and metabolic dysfunction in the absence of 
other dietary manipulations in mice. The micro- 
biota was both required and sufficient for these 
effects as GF mice were protected from detrimen- 
tal consequences, and microbiota transfer from 
emulsifier-treated mice was sufficient to recapit- 
ulate the metabolic derangements (55). These 
results are particularly striking given the wide 
range of foods in which these emulsifiers are 
found (such as gluten-free and reduced-fat prod- 
ucts, ice cream, wine, and pickles), and 
that the doses used in the study are 
well within the amount consumed by 
humans. In addition to emulsifiers, 
non-nutritive sweeteners (NNS) have 
been linked to gut-associated meta- 
bolic alterations. In a series of experi- 
ments that used rodents and humans, 
NNS consumption induced glucose in- 
tolerance in a microbiota-dependent 
manner (56). However, the collective 
data regarding the effects of NNS on 
the gut microbiota and metabolic func- 
tion are equivocal, and it is important 
to recognize that NNS represent a broad 
class of substances with tremendous 
structural and functional diversity. Thus, 
the physiological effects of a particular 
NNS should not be generalized, and 
additional human intervention studies 
examining the impact of individual 
NNS are needed. 


ysis of the synthesis pathways involved 
suggests that bacteria cooperatively 
exchange B vitamins to ensure sur- 
vival (48). The relation between vita- 
mins and the microbiota appears to be 
bidirectional because several vitamins 
supplied by the host shape microbial 
composition and provide critical func- 
tions within bacteria. Riboflavin, for 
example, regulates bacterial extracel- 
lular electron transfer and redox status 
(49), and vitamin D and its receptor 
help regulate intestinal inflammation, 
in part by shaping microbial ecology 
(50). Like vitamins, metals are required cofactors 
for numerous mammalian and bacterial physio- 
logical processes and can dramatically alter the 
microbiota. Zinc deficiency, which is a strong 
risk factor for potentially fatal childhood diarrhea 
in developing countries, enhances populations 
of pathogenic bacteria (57). Iron is an essential 
micronutrient for pathogen growth, and restrict- 
ing iron intake is an effective form of nutritional 
immunity against pathogen establishment. Human 
breastmilk transmits lactoferrin, an iron-binding 
glycoprotein, to protect the undeveloped infant 
gut from pathogen colonization, and iron sup- 
plementation in infants can increase pathogen 
growth and intestinal inflammation (52). Despite 
observed bacteriogenic effects of iron, supple- 
mentation in experimental mice was recently 
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Fig. 2. Gut bacteria play an important role in bile acid modification. 
Primary bile acids deposited into the small intestine are 
deconjugated and dehydroxylated by enzymes from bile-modifying 
bacteria. These changes influence total bile acid pools available 

for reabsorption and recycling through enterohepatic circulation. 

In addition, bile acids can act as regulatory molecules by binding to 
cell surface or nuclear receptors, influencing host factors such as 
energy expenditure and lipid metabolism. 


await discovery, represent an important avenue 
of future research. The data also highlight the 
importance of monitoring micronutrient com- 
position in microbiota-focused dietary interven- 
tion studies and beget the need for clinical trials 
in populations at risk for vitamin and mineral 
deficiencies. 


Food additives 


The impact of food additives on the gut micro- 
biota and intestinal homeostasis represents 
another understudied area with potential im- 
plications for human health. Although micro- 
bial and health consequences of Western diets 
are typically attributed to macronutrient com- 
position, several studies suggest that the detri- 
mental effects may be driven by food additives. 
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Dietary patterns 


One obvious limitation to studying the 
health effects of individual nutrients 
is that those nutrients are rarely con- 
sumed in isolation; thus, experimental 
manipulation of an individual macro- 
nutrient invariably alters intake of other 
macronutrients that may have meta- 
bolic effects unto themselves. For exam- 
ple, high-fat diets are commonly low in 
fiber, and it may be this latter feature, 
and its downstream effects on the micro- 
biota, that drives some of the metabolic 
consequences of the diet rather than 
the elevated fat content by itself. Given the limi- 
tations of studying nutrients in isolation, there is 
increasing focus away from this experimental 
reductionism and toward examining the health 
effects of broader dietary patterns. 


Ketogenic diet 


The ketogenic diet is characterized by very low 
CHO consumption (5 to 10% of total caloric 
intake), sufficient to enhance ketone produc- 
tion. It was originally developed as a treatment 
for refractory childhood epilepsy, and response 
of the gut microbiota to a ketogenic diet appears 
to play a role in the efficacy of this intervention in 
epileptic children (57). Animal data suggest that 
the neuroprotective effects are mediated through 
modulation of specific gut bacteria that enhance 
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hippocampal y-aminobutyric acid/glutamate levels 
(58). In recent years, the benefits of ketogenic diets 
have extended beyond seizure control, and the 
diets are commonly adopted for weight loss and 
have been shown to enhance longevity and re- 
duce disease onset in experimental animals (59). 
Conversely, some human studies that examined 
ketogenic diets suggest negative impacts on mi- 
crobial ecology and gut health. However, these 
studies were conducted in small cohorts with 
specific metabolic conditions (60, 67), limiting 
generalization to larger populations. Because 
modified versions of ketogenic diets are rapidly 
growing in popularity, it is necessary to exam- 

ine their long-term safety and impacts on the 

gut microbiota and intestinal environment. 


Paleolithic diet 


The Paleolithic diet, which seeks to mimic 


the dietary patterns of pre-agricultural so- oie 


cieties, is often implemented as a high- 


protein/low-CHO diet for weight loss by ee) 


TM) 


individuals in Western societies. Clinically, 
the Paleolithic diet is being explored for 
management of inflammatory bowel dis- 
ease (IBD), and although initial findings 
were promising, the study was conducted 
in a small cohort, and additional nutrient 
supplementation was required to curtail iron 
and vitamin D deficiencies (62). Although 
there is a paucity of intervention studies 
that examine microbiota-specific effects of 
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modest microbiota differences between omni- 
vores and vegetarians and suggest that the effects 
of dietary pattern on the microbiota are greatest 
at the genus and species level and relatively min- 
imal on broader compositional features, such as 
diversity and richness (65-67). Despite the ab- 
sence of global microbiota compositional shifts, 
the species-level changes appear sufficient to 
alter metabolic outputs because SCFA produc- 
tion is typically increased in vegetarians. The 
extent to which these microbial metabolic out- 
puts mediate the beneficial effects of vegetarian 
diets is unclear. 
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Mediterranean diet 

The Mediterranean diet emphasizes consump- 
tion of a variety of foods (fruits, vegetables, 
legumes, unsaturated fats, and limited red meat 
intake) rather than the exclusion of particular 
food groups or confinement to specific macro- 
nutrient ratios. Numerous epidemiologic studies 
and clinical trials have demonstrated that fol- 
lowing a Mediterranean diet reduces the risk of 
all-cause mortality and multiple chronic diseases 
(64, 70). Although only a few of these studies 
have examined the effects on microbiota compo- 
sition, existing data indicate that the Mediterra- 
nean diet elicits favorable microbiota profiles 
and metabolite production, with microbial 
diversity paralleling levels of dietary adher- 
ence (7/-73). For example, closer adherence 
to the Mediterranean dietary pattern was 
associated with lower ratios of Firmicutes: 
Bacteroidetes and higher fecal SCFA detec- 
tion (73). Collectively, these studies suggest 
that emphasis should be placed on suffi- 
cient inclusion of a variety of plant-based 
foods rather than exclusion of animal-based 
food and supports the concept that diet di- 
versity is a driver of microbiota stability. 


Microbiota-targeted diets 


Anumber of microbiota-targeted diets have 
recently emerged with the growing public 
awareness of the gut microbiota and its 
potential to influence human health. Al- 


replacing a Western diet with a Paleolithic 
diet, comparative studies between industri- 
alized populations and modern-day hunter- 
gatherer societies have provided some insight. 
The Hadza, a hunter-gatherer tribe whose 
lifestyle has been described as mimicking 
that of Paleolithic communities, experience 
few metabolic diseases that plague indus- 
trialized societies, and their microbiota is 
characterized by greater microbial diversity 
(9, 63). However, it is difficult to ascribe 
these microbiota and health benefits to 
lower CHO intake by itself because the 
Hadza diet is rich in plant-derived MACs, 
and their microbiota contains a high abun- 
dance of CHO-metabolizing bacteria (63). 
Furthermore, diets of Western societies lack many 
of the traditional foods and seasonal variation 
of the Hadza, highlighting the limited parallels 
between traditional hunter-gatherer diets and 
modern-day Paleolithic diets adopted by individ- 
uals in Western societies. 


Vegan/vegetarian diets 


Plant-rich diets have long been a key feature of 
dietary recommendations, and vegan/vegetarian 
diets are associated with positive health outcomes 
and reduced disease risk (64). These beneficial 
effects may extend to the gut microbiota. Plant- 
based foods constitute the primary source of 
dietary MACs, and the microbiota of individu- 
als who consume vegetarian or predominantly 
plant-based diets exhibit greater capacity for 
MAC fermentation. However, some intervention 
and cross-sectional studies have observed only 
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Fig. 3. Interactions between amino acids and the gut 
microbiota. Microbial metabolism of the amino acid carnitine 
produces trimethylamine (TMA), which is subsequently 
oxidized in the liver to TMAO in a reaction catalyzed by flavin- 
containing monooxygenase (FMO). Increased levels of 
circulating TMAO have been linked to metabolic disease. 
Gut microbes metabolize the amino acid tryptophan into various 
substances, including indolepropionic acid (IPA) and indole-3- 
acetic acid (I3A), both of which can enter the general circulation. 
The metabolic effects of IPA, I3A, and other microbially derived 
amino acid metabolites are only now beginning to emerge. 


In addition to providing MACs, plant-based 
foods provide a diverse source of phytochemicals, 
biologically active small molecules with the po- 
tential to affect human health. Within the plant, 
many phytochemicals are glycosylated, reducing 
their bioavailability and bioactivity when con- 
sumed. As a result, phytochemicals often reach 
the lower intestinal tract and can have direct anti- 
microbial and anti-inflammatory effects in the gut. 
In addition, phytochemicals can be modified by 
microbial enzymes into metabolites with increased 
bioavailability and altered bioactivity (68). A prom- 
inent example includes the bioconversion of natu- 
rally occurring soy isoflavones to equol (69), which 
exhibits increased bioavailability compared with 
that of the parent compounds. Thus, microbe- 
mediated alterations in phytochemical bioavail- 
ability may represent an additional mechanism 
underlying the beneficial effects of plant-based diets. 
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though the scientific premise of many of 
these diets is logically rooted in prevail- 
ing paradigms, they fail to acknowledge 
the many gaps in our knowledge regard- 
ing diet-microbiome-host interactions. The 
specific carbohydrate diet (SCD) restricts 
complex carbohydrates and refined sugar 
under the premise that these compounds 
are malabsorbed in the gastrointestinal 
tract, leading to bacterial overgrowth and 
gut dysbiosis. Like the low-FODMAPs diet, 
there is some evidence to suggest that in- 
dividuals with IBD experience benefits 
when following a SCD (74), although long- 
term adherence to this restrictive diet is 
difficult. A derivative of the SCD is the gut 
and psychology syndrome (GAPS) diet, which 
expands beyond dietary exclusion of specific foods 
by adding foods claimed to have gut-healing po- 
tential. This includes consumption of nutrient-rich 
bone broth to regenerate the intestinal lining and 
antimicrobial foods such as garlic to reduce path- 
ogen loads. Although this diet is frequently 
prescribed by alternative medicine practitioners 
for conditions ranging from autism to depres- 
sion, there is only anectotal rather than scientific 
evidence to support that adherence produces 
the intended effects on health outcomes. A study 
supporting the use of a similar diet was recently 
retracted from the journal PLOS ONE because of 
several issues, including the absence of a con- 
trol group, inadequate description of methods, and 
lack of microbiota analysis (75). Last, although 
not a diet in itself, there is a common belief that 
probiotic-containing fermented foods modify the 
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gut microbiota and confer host health benefits. 
However, fermented foods rarely contain ade- 
quate amounts of specific probiotic organisms. 
Moreover, there is limited evidence for the role 
of probiotics as modulators of the human gut 
microbiota, and recent data suggest that even 
supplemental quantities of probiotics exert lim- 
ited effects on human gut ecology (76) and may 
even be detrimental with regard to recoloniza- 
tion of the microbiota after antibiotic use (77). 
With the exception of yogurt, benefits of fer- 
mented foods have not been clinically tested, and 
studies examining the effects of yogurt for weight 
management are equivocal (78). Evidence sug- 
gests that acetic acid, a metabolic by-product of 
fermentation, suppresses appetite and reduces 
postprandial insulin response and glucose ex- 
cursions (79). In addition, some fermented foods 
have higher nutrient content and elevated B vita- 
mins as compared with those of nonfermented 
forms of the same food (80), suggesting that any 
added benefits may be due to microbial metab- 
olites rather than the microbes themselves. 


Perspectives and future directions 


Data collected over the past decade have iden- 
tified the gut microbiota as an important factor 
defining interindividual variation in disease risk 
and dietary response. The ascension of the gut 
microbiota as a key regulator of human physiol- 
ogy has generated tremendous excitement within 
and beyond the scientific community, as exem- 
plified by the exponential increase in microbiota- 
focused publications and by the growth of the 
probiotic market into a multibillion-dollar indus- 
try. The rapidity of this ascension, however, poses 
a substantial challenge in that commercialization 
and popularity of microbiota-targeted therapies 
have accelerated despite the fact that fundamen- 
tal questions regarding the microbiota and its re- 
lation to diet and human health remain unresolved. 
For example, much of the current data linking the 
microbiota to disease processes have been gener- 
ated in animal models, and human feeding studies 
are needed to confirm their relevance before they 
can be translated to practical nutrition advice. 
The plasticity of the microbiota makes 
microbiome-targeted interventions an attract- 
ive approach for disease prevention and treat- 
ment. However, despite reported alterations of 
the gut microbiota in response to short-term die- 
tary interventions, long-term dietary patterns are 
associated with stable microbiota conformations 
that are difficult to alter (87-83). Additionally, 
diet-responsive members of the microbiota often 
represent a small proportion of the total com- 
munity, necessitating a better understanding of 
whether changes in tractable populations are 
sufficient to elicit physiological outcomes in the 
host. Last, the degree of microbiota plasticity, 
and thus the potential for response to microbiota- 
directed interventions, may be dependent on 
baseline microbial populations (76) and past 
dietary patterns (82). For example, recent data 
from mice colonized by human bacteria indicate 
that diets associated with reduced microbial 
diversity may have impaired responsiveness to 
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dietary interventions, requiring reexposure to 
diet-responsive microorganisms (82). Thus, iden- 
tifying presence of key diet-responsive micro- 
organisms may be important in predicting success 
of dietary interventions in humans. 

Although plasticity offers an opportunity for 
microbiota-targeted therapeutics, it represents 
a double-edged sword in that it also catalyzes 
deleterious effects of external stressors, such as 
an unhealthful diet. Dysbiosis, an alteration in 
the composition of the microbiota associated 
with a disease state, has been linked with poor 
diet and numerous chronic diseases. However, 
determination of an absolute, rather than com- 
parative, definition of dysbiosis remains a funda- 
mental unresolved question. The recently proposed 
“Anna Karenina hypothesis” suggests that dys- 
biosis is not a definitive pattern but rather the loss 
of microbiota stability, resulting in dispersion 
from the norm (84). This indicates the need to 
define a healthy microbiota, which is problematic 
on an individual basis, given that factors such as 
environment, genetics, past diet, exercise, and ge- 
ography all play a role in shaping the microbiota, 
and these individual influences may or may not 
translate to tangible health outcomes. 

Despite current unresolved questions that 
limit practical and clinical translation of mi- 
crobiota research, several new developments 
promise continued advancement of the field. 
Integration of proteomic and metabolomic data 
with existing DNA-based methods of microbiota 
assessment could circumvent the need to define 
healthy versus unhealthy microbial populations 
as more accurate functional profiling emerges. 
To this end, the Integrated Human Microbiome 
Project was recently launched to functionally 
characterize human cohorts in various life stages 
and at the onset of specific disease states (85). 
Additionally, new bioinformatics tools and mod- 
eling algorithms are improving our ability to ap- 
ply microbiota data to human outcomes. Examples 
of recent successes include the development of 
algorithms that incorporate microbiota profiles 
to predict post-dieting weight gain (86) and de- 
velop personalized diet recommendations for 
control of postprandial glycemic responses (87). 
These examples demonstrate how realization of 
the potential for microbiota-diet interactions could 
change future approaches to nutrition. 
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Swifter, higher, stronger: 
What’s on the menu? 


Louise M. Burke”?* and John A. Hawley” 


The exploits of elite athletes delight, frustrate, and confound us as they strive to reach 
their physiological, psychological, and biomechanical limits. We dissect nutritional 
approaches to optimal performance, showcasing the contribution of modern sports 
science to gold medals and world titles. Despite an enduring belief in a single, superior 
“athletic diet,” diversity in sports nutrition practices among successful athletes arises 
from the specificity of the metabolic demands of different sports and the periodization 
of training and competition goals. Pragmatic implementation of nutrition strategies in 
real-world scenarios and the prioritization of important strategies when nutrition themes 
are in conflict add to this variation. Lastly, differences in athlete practices both promote 
and reflect areas of controversy and disagreement among sports nutrition experts. 


ncient Olympians manipulated their diets 
according to prevailing beliefs, with Pythag- 
oras being credited (probably incorrectly) 
for introducing athletes to meat and protein- 
rich foods in place of traditional figs, ce- 
reals, and cheese (7). Meanwhile, modern-day 
athletes are bombarded with social media “war- 
riors” who evangelize vegan, Paleo, and low-carb 
“keto” diets for peak performance. In contrast to 
the battle over the perfect menu, contemporary 
sports nutrition embraces diversity in dietary 
practices, underpinning the demands of training 
and competition with the philosophies of spec- 
ificity, periodization, and personalization (2). 
The metabolic demands of elite sport are 
complex, with events lasting from seconds (jumps, 
throws, and lifts) to several weeks (Grand Tour 
cycling races). Performance outcomes culminate 
from deliberate, sport-specific training aimed at 
maximizing adaptations toward the fulfilment 
of individual genetic potential (3). Although some 
elite athletes benefit from systematic, science- 
driven advice on training adaptation and com- 
petition performance, others use trial-and-error 
approaches under the guidance of experienced 
coaches, leaving scientists to explain post hoc 
how diet might have contributed to their per- 
formance peaks (3). 


Solving the fuel crisis 


Energy for competitive sports is provided by 
transforming chemical energy (intramuscular 
glycogen and lipids) into mechanical energy 
(contraction), with adenosine triphosphate (ATP) 
being the metabolic intermediary (4). Because 
intramuscular ATP stores are small, exercise- 
associated increases in ATP turnover within 
active myocytes (up to 100 times the resting 
turnover rate) pose a major energetic challenge. 
Metabolic pathways for resynthesizing ATP are 
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University, Melbourne 3000, Australia. 
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rapidly activated during short-term (<30-s) 
sprints, primarily through substrate-level phos- 
phorylation: phosphocreatine (PCr) breakdown 
and the conversion of muscle glycogen to lactate. 
However, as ATP production becomes unable to 
match rates of utilization, a range of metabolic 
by-products accumulates. Some of these, such as 
hydrogen ions, appear to exert negative feedback 
on the pathways that produce them to prevent 
further disruption to homeostasis, whereas others, 
including adenosine monophosphate (AMP) and 
organic phosphate, stimulate energy sensors, such 


as the 5'-AMP-activated protein kinase, to main- 
tain cellular homeostasis by regulating anabolic 
and catabolic pathways, thereby ensuring a bal- 
ance between energy supply and demand. 

Sporting activities lasting several minutes to 
several hours and performed either as a steady 
state (e.g., marathon running) or with intermit- 
tent high-intensity bursts (e.g., team sports) are 
fueled principally by the oxidation of intra- 
muscular glycogen and, to a lesser extent, lipids, 
whereas the mobilization of extramuscular sub- 
strates [plasma glucose from the liver and gut 
and free fatty acids (FFAs) released from adipo- 
cytes] becomes more important as exercise dura- 
tion increases. Training enhances the metabolic 
flexibility of the myocyte, enhancing the size 
of substrate pools and the capacity to rapidly 
switch between carbohydrate (CHO)- and fat- 
based fuels to meet the demands of the work- 
ing muscles. 

The location- or fiber-specific depletion of 
muscle glycogen stores is often associated with 
fatigue, and since the introduction of the per- 
cutaneous needle biopsy technique to exercise 
science in the 1960s (5), numerous investiga- 
tions have examined strategies to promote the 
storage of glycogen before or between exercise 
bouts. The key factor in the synthesis of this 
macromolecule is the quantity of dietary CHO 
consumed (6), but because maximal hourly syn- 
thesis rates are equivalent to ~5% of the size of 
normalized stores, athletes need to plan adequate 
time as well as sufficient CHO intake to increase 


Box 1. Move over, muscle: The brain’s the boss! 


The brain and CNS are implicit in skilled tasks and events requiring concentration and 
decision-making. Only recently, however, have we recognized their role in the performance 
of even simple locomotor events, including strategies around pacing. A century ago, Bainbridge 
wrote, “There appear, however, to be two types of fatigue, one arising entirely within the central 
nervous system, the other in which fatigue of the muscles themselves is superadded to that of 
the nervous system” (71). Despite this early insight, sports nutrition has evolved with a bias 
toward studying peripheral mechanisms of fatigue and their role in performance, possibly 
because of the opportunities provided by available research tools (72). Exceptions are noted; 
hypoglycemia has long been recognized as a cause of fatigue during endurance sports (73), 
and brain astrocytes are now known to have labile glycogen stores (74). Furthermore, we now 
explain the ergogenic benefits of caffeine through central roles (reduced perception of effort 
and increased neural recruitment of muscle fibers) rather than a metabolic origin (muscle 
glycogen sparing because of increased availability and oxidation of plasma FFAs) (75). 

An intriguing theme in modern sports nutrition involves the CNS and nutrients that can 
enhance performance without even being absorbed (76). This interest emerged from the 
recognition that exogenous CHO intake could improve ~1-hour cycling time trial performance, 
even though muscle substrate (glycogen) availability is not rate limiting for this event (76). 
The same performance benefits were detected when the mouth was simply rinsed with a 
glucose or maltodextrin solution (77), exposing receptors in the oral cavity to CHO and 
stimulating reward centers in the brain to increase pace or work output (78). The “oral sensing” 
benefits of CHO have been shown to be robust and repeatable when undertaken throughout an 
event, offering new performance nutrition strategies and a different range of targeted sports 
(76). Although this science is still in its infancy, evidence shows benefits of oral sensing from 
other tastants (e.g., fluid and caffeine), and effects have been reported for menthol (perception 
of cooling), quinine [activation of the autonomic nervous system and/or corticomotor excitability 
for brief events (79)], and capsaicin, acetic acid, cinnamaldehyde, and other plant chemicals that 
are known transient receptor potential channel activators and may prevent exercise-associated 


muscle cramps (80). 


16 November 2018 


lof7 


810z ‘6| 4equweAoN UO /fio Beweouslos’e0usI0S//:di1y Wo pepeojuMOGg 


ILLUSTRATION: V. FALCONIERI BASED ON L, M. BURKE AND J. A, HAWLEY 


glycogen to levels commensurate with the demands 
of the upcoming session (6). Factors reducing 
the efficiency of glycogen synthesis from a given 
CHO intake include muscle damage and in- 
adequate energy intake, whereas other factors 
(training status, the severity of glycogen deple- 
tion, intake within the hours following strenuous 
exercise, and co-ingestion of protein) can increase 
synthesis rates or absolute stores (6). The resting 
glycogen concentrations in endurance-trained 
muscle are higher than those in sedentary in- 
dividuals; furthermore, glycogen supercompen- 
sation (“CHO loading”) can be achieved in trained 
muscle with as little as 24 to 48 hours of reduced 
activity and high intakes of CHO [10 to 12 g per 
kilogram of body mass (BM) per 24 hours] (6). 
For at least 50 years, such techniques have been 
used by athletes to enhance their performance 
in endurance events in which glycogen deple- 
tion would otherwise occur (7). 

Of course, the fuel demands of many sports 
exceed muscle glycogen storage capacity or the 
athlete’s opportunity to replete endogenous stores 
between events. Exogenous CHO (CHO consumed 
in the hours before and/or throughout exercise) 
maintains euglycemia by “sparing” hepatic glu- 
cose production (8), with blood glucose making 
an increasing contribution to rates of muscle CHO 
oxidation as glycogen stores become depleted 
(9). Strong evidence suggests that performance 
in a range of sports and exercise scenarios is 
enhanced by consuming CHO during exercise, 
with intakes targeted to the muscle’s need to 
supplement its diminishing glycogen reserves 
[30 to 60 g/hour in endurance events of up to 
2 to 3 hours duration and 60 to 90 g/hour in 
ultra-endurance events lasting 8 to 10 hours 
(10)]. Intestinal CHO absorption is likely the 
rate-limiting step in the oxidation of ingested 
CHO (J1). However, to some extent this can be 
overcome by “training the gut”: increasing CHO 
intake in the diet and during exercise to in- 
crease tolerance and the activity of sodium- 
dependent glucose transporter SGLT-1 (12, 13). 
The use of glucose-fructose mixtures that utilize 
different gut transporters can also increase total 
intestinal absorption and rates of muscle oxi- 
dation of ingested CHO (14). 

The relatively large lipid stores in even the 
leanest of athletes have, understandably, intrigued 
sports scientists as a potential source of fuel for 
prolonged aerobic exercise. However, whereas 
CHO oxidation is closely geared to the energetic 
demands of the working muscles, no mecha- 
nisms exist for closely regulating the availability 
and metabolism of FFAs to the prevailing energy 
expenditure. Short-term strategies [overnight fast- 
ing or low-CHO, high-fat (LCHF) eating] have 
proven unsuccessful in enhancing performance, 
despite increasing FFA availability: The small 
increase in FFA oxidation is insufficient to re- 
place the contribution of CHO after near deple- 
tion of liver and muscle CHO stores (75). An 
alternative strategy (16) to potentially boost the 
use of both substrate pools involves exposure 
(5 days) to LCHF (60 to 70% fat) diets to pro- 
mote muscle retooling to enhance FFA transport 
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Fig. 1. Periodized nutrition: Evolution of a nutritional practice. Commencing endurance 
training with lowered muscle glycogen stores (training low) results in greater transcriptional 
activation of enzymes involved in CHO and fat oxidation, as well as greater mitochondrial 
biogenesis, than undertaking exercise with a normal or elevated glycogen content 

(29, 30). Restricting CHO availability during the early (1 to 5 hours) postexercise recovery 
period also acutely up-regulates various markers of substrate metabolism and endurance 
training adaptation in skeletal muscle (45). Against this background, we formulated a 

novel approach in which we can undertake high-quality, high-intensity training and then 
prolong the duration of low CHO availability during recovery and subsequent aerobic exercise, 
thereby potentially extending the time course of transcriptional activation of metabolic 
genes and their target proteins. We have termed this practice “train high, sleep low” (45, 46). 
PPAR, peroxisome proliferator—activated receptor; AMPK, 5’-AMP-activated protein kinase; 
MAPK, mitogen-activated protein kinase; COX, cyclooxygenase. 


and utilization, followed with restoration of 
endogenous and exogenous CHO availability 
(24 hours of high-CHO diet and CHO intake 
before and during exercise). Despite substan- 
tial increases in rates of fat oxidation after such 
protocols, benefits to endurance performance 
have been, at best, limited to specific scenarios 
or individuals (17). 

More notably, the observed reduction in the 
utilization of CHO during submaximal exercise 
(“glycogen sparing”), initially thought to be ad- 
vantageous in preserving this fuel for later ox- 
idation, was discovered to be impaired CHO 
oxidation, caused by reduced muscle glycogen- 
olysis and the down-regulation of flux through 
the citric acid cycle secondary to reduced pyru- 
vate dehydrogenase activity (78). In sports where 
success is determined by high-intensity aerobic 
exercise, either throughout the event (such as in 
a cycling time trial or 10,000-m run) or at crit- 
ical stages [within team sports or the “break- 
aways” and finishes in marathons, Ironman 
triathlons (79), and longer cycle races (20)], the 
highest sustainable rates of muscle energy turn- 
over require the better economy of ATP pro- 
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duction from CHO oxidation. Short-term fat 
adaptation strategies, or even long-term adap- 
tation to ketogenic LCHF diets (80% fat, <50 g 
of CHO/day), which can increase normal rates 
of fat oxidation by two or three times (21, 22), 
are limited in application to a small range of 
sporting events in which utilization is low 
enough for muscle energy to be provided by 
fat oxidation (21, 23). To date, it appears that 
protocols that substantially increase fat oxida- 
tion also decrease metabolic flexibility by reduc- 
ing CHO substrate pools and/or the ability to 
rapidly oxidize them. The bottom line is that 
when elite athletes train for and compete in 
most sporting events, CHO fuels are the pre- 
dominant and critical substrate for the working 
muscles, and the availability of CHO (22, 24), 
rather than fat, wins gold medals. We propose 
that the increased rates of fat oxidation ob- 
served after endurance training and “train-low” 
strategies (see When less is more) are a proxy for 
an increase in mitochondrial density; for com- 
petition success, this machinery is best utilized 
by harnessing it to enhance the oxidation of CHO- 
based fuels. 


2 of 7 


810z ‘6| 4equieAoN UO /Hio Beweouslos' eo0usI0S//:di1y Wo pepeojuMOG 


SPECIAL SECTION 


DIET AND HEALTH 


Training nutrition: A balancing act 

A reductionist view of training identifies a range 
of interdependent adaptations that permit ath- 
letes to sustain the highest rate and yield of 
energy production, optimize economy of motion, 
defend cellular homeostasis, and delay the onset 
of fatigue while simultaneously attaining the 
optimal physique and technical skills specific 
to their events (3, 25). To achieve these ends, 
elite athletes and their coaches integrate a series 
of workouts that individually target important 
competition performance traits into a periodized 
training program composed of (weekly) micro- 
cycles and (3- to 6-week) mesocycles, culminat- 
ing in targeted competition peaks within the 
(annual) macrocycle. The long-term adaptations 
to exercise training, such as those observed in 
elite athletes, result from the cumulative effect 
of the many transient increases in mRNA tran- 
scripts encoding various proteins after each acute 
exercise session (26, 27). These repeated bursts in 
mRNA expression appear to be essential to drive 
the chronic intracellular adaptive response to 
exercise training (27). 

With regard to the training diet, early nutri- 
tion guidelines promoted a singular and some- 
what static approach, focusing predominantly 
on CHO-based fuels as the major energy source 
for muscle. However, contemporary guidelines 
acknowledge differences in the requirements and 
goals of different sessions or phases of training, 
leading to a periodization of the athlete’s diet 
(2, 28). In principle, nutrient support is organized 
around each training session to maximize physi- 
ological outcomes within a framework that ad- 
dresses larger nutrition goals. Recommendations 
target total energy and fuel availability, with a 
focus on protein and CHO intakes and their 
distribution throughout the day (2). For high- 
intensity or high-quality training sessions, high 
endogenous and exogenous CHO availability is 
recommended, with the timing and intake of 
CHO matching or exceeding the muscle fuel re- 
quirements to support training quality (2, 28, 29). 
Some training sessions should develop competi- 
tion fluid and CHO intake plans, incorporating 
practice of event-specific behaviors as well as 
training the gut function needed to tolerate and 
deliver these nutrients into the bloodstream. 
CHO intake can be reduced for lighter training 
loads (2, 29); furthermore, adaptations to endur- 
ance training may be enhanced by deliberately 
commencing some sessions under conditions of 
low exogenous and endogenous CHO availabil- 
ity (28-30) (see When less is more and Fig. 1). 
As a result, daily CHO intakes typically vary from 
2 to 12 g per kilogram of BM among athletes and 
across training cycles (37). 

New perspectives on protein intake target the 
daily consumption of useful amounts (~0.3 g 
per kilogram of BM) of high-quality sources at 
four to six meals or snacks, particularly during 
the 1- to 3-hour window after key training ses- 
sions (32, 33) and, perhaps, a double dose before 
sleep (34). This approach extends previous rec- 
ognition that athletes have total daily protein 
requirements in excess of the standard Recom- 
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mended Dietary Allowances to optimize the 
sustained (~24-hour) increase in contraction- 
stimulated synthesis of muscle protein that 
occurs after strength or endurance exercise by 
supplying leucine to further up-regulate the mam- 
malian target of rapamycin complex 1 (MmTORC1) 
pathway (32, 33). Attention to nutrients for which 
athletes have increased requirements [such as 
water, electrolytes, and iron (35)] or risks of 
deficiency [such as vitamin D (36)] is also im- 
portant in maximizing training adaptation, es- 
pecially during training blocks undertaken in 
the heat or at high altitude (3). 

Elite athletes straddle a thin line between 
maximizing overall training stimulus to promote 
sport-specific adaptation and remaining free of 
illness or injury. Within the framework of any 
periodized training program, each acute exercise 
session is an integral part of a long-term goal, 
with the training impulse (the sum of the in- 
tensity, duration, and frequency of sessions) being 
finely balanced to underpin optimal adaptation 
for a specific competition peak. The integration 
of nutrition goals for a specific training session 
or phase within the larger nutrition plan often 
creates tension between opposing themes. Ath- 
letes, both males and females, can develop rel- 
ative energy deficiency in sport (RED-S), with a 
mismatch between energy intake and the en- 
ergy expended in exercise leading to impairments 
in metabolic rate, bone health, protein synthesis, 
production of reproductive hormones, and per- 
formance gains (37). Yet, RED-S can occur sec- 


ondarily to the body fat manipulation and very 
high training volumes that typically underpin 
success (38); this necessitates careful implemen- 
tation and scheduling of such phases into the 
annual plan (39). Other strategies that need to 
be balanced or strategically managed include 
training with low CHO availability, which in- 
creases the inflammatory (interleukin-6) response 
to exercise, with acute disturbances to the im- 
mune system (40) and bone metabolism (47) that 
appear to persist with long-term exposure (42). 
Similarly, the use of antioxidant supplements 
acutely reduces free oxygen radical damage to 
muscle fibers but potentially dampens the train- 
ing response by interfering with redox-sensitive 
signaling pathways; this has also been seen to 
translate into a reduction in performance (43). 


When less is more 


The application of molecular biology techniques 
to exercise science has identified the complexity 
and breadth of intracellular signaling networks 
by which different exercise modes drive adaptive 
changes that underpin the athletic phenotype 
(3, 25). Altering nutrient availability, particularly 
endogenous CHO stores, selectively modulates 
gene expression and intracellular signaling with- 
in the muscle; mechanisms include alterations in 
cell osmolality and increased activity of mole- 
cules within the regulatory CHO-binding do- 
main of the AMP-activated protein kinase, as 
well as perturbations to circulating FFAs and 
hormones in concert with plasma glucose and 


Box 2. Performance in a bottle. 


Sports products represent a lucrative portion of the worldwide explosion in the manufacture 
and marketing of supplements; according to one report, sports supplements generated global 
revenue of $9 billion in 2017, with a doubling of this value forecasted by 2025 (81). Surveys 
confirm the high prevalence of sports food and supplement use among athletes, including 
greater use at higher levels of competition (82). Despite earlier reluctance, many expert groups, 
including the International Olympic Committee (83), now pragmatically accept the use of 
supplements passing a risk-benefit analysis as being safe, effective, legal, and appropriate to 
an athlete’s age and maturation in their sport. Supplements used by athletes fall into different 
categories (83): nutrient supplements for the treatment or prevention of deficiencies (e.g., iron 
and vitamin D); sports foods providing energy or nutrients when it is impractical to consume 
everyday foods (e.g., sports drinks and protein supplements); performance supplements that 
directly enhance exercise capacity; and supplements that provide indirect benefits through 
recovery, body composition management, and other goals. Despite enthusiastic marketing, from 
the latter two groups, only a few products enjoy robust evidence of efficacy [e.g., caffeine, 
creatine monohydrate, bicarbonate, B-alanine, and nitrate (84)] (Fig. 2). 

Any benefits associated with supplement use must be balanced against the expense, the 
potential for adverse outcomes due to poor protocols for use (e.g., excessive doses or inter- 
actions with other supplements), and the dangers inherent with products whose manufacture 
and marketing are less regulated than those of foods or pharmaceutical goods. There are safety 
issues around dietary supplements in general, which accounted for ~23,000 notifiable emer- 
gency department visits in the United States in 2015 (85). Elite athletes also need to consider 
that supplements have been found to contain contaminants or undeclared ingredients that are 
prohibited by the antidoping codes (86) under which they compete; these include stimulants, 
anabolic agents, selective androgen receptor modulators, diuretics, anorectics, and B2 agonists 
(87). Strict liability codes mean that a positive urine test can trigger an Adverse Doping Rule 
Violation with potentially serious effects on the athlete’s career, livelihood, and reputation, 
despite unintentional intake or minute (ineffective) doses. Third-party auditing of products can 
help elite athletes make informed choices about supplement use but cannot provide an absolute 


guarantee of product safety (83). 
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insulin concentrations (3, 29). Within their rep- 
ertoire of training nutrition strategies, athletes 
can now include practices that augment adapt- 
ive processes in skeletal muscle; these include 
commencing training with low exogenous CHO 
availability (fasting overnight and/or withholding 
CHO during a session) or the more potent train- 
low strategy of deliberately commencing selected 
training sessions with lowered muscle glycogen 
stores (e.g., using a first session to deplete gly- 
cogen and then training for a second time after 
withholding CHO to prevent glycogen restora- 
tion) (29, 30). 

Although studies consistently report aug- 
mented cellular responses as a result of train- 
low strategies, the translation to performance 
enhancement has been less clear (29, 30). Early 
investigations failed to detect superior perform- 
ance outcomes; this was attributed to the over- 
emphasis of such sessions within the training 
program and their resultant impairment 
of training intensity (44). These sessions 
need to be appropriately placed into a 
periodized program to complement high- 
quality training (7). A recent, clever se- 
quencing of practices (Fig. 1) integrates 
a performance-promoting session and an 
adaptation-focused session while adding 
the benefits of a prolonged increase in 
exercise-stimulated cellular signaling and 
posttranscriptional regulation during 
glycogen-depleted recovery and exercise 
(45). In subelite populations at least, bet- 
ter integration of train-low and train-high ses- 
sions into the training sequence (Fig. 1) has 
been associated with superior performance com- 
pared with the same training undertaken with 
normal CHO availability (46). So far, however, 
this does not seem to be the case in studies 
involving elite populations (22, 47), although 
it is often incorporated into real-world training 
sessions (48). Although further studies are 
needed, part of the challenge in advancing this 
area of research is the lack of agreement with 
regard to the terminology and implementation 
of the practices involved; we have tried to address 
this in a separate commentary (7). 


Fighting fatigue: Eating to win 


Each sport has distinct features, but character- 
istics shared by all competitors are a desire to 
pace their performance to achieve the highest 
sustainable outputs or speeds and maintain 
technical proficiency, with the likelihood of a 
reduction in some performance metrics inter- 
mittently, toward the end of the event, or both. 
“Fatigue” is defined operationally as a periodic 
or sustained decline in the athlete’s ability to 
optimally perform the tasks required of their 
sport. Although fatigue is often characterized as 
muscular (decreased force or power production) 
or mental (increased ratings of perceived ex- 
ertion or loss of skill and cognitive abilities), 
there is interplay between these phenomena. 
Muscular fatigue has both peripheral (related 
to the exercising muscle) and central [related to 
the ability of the central nervous system (CNS) 
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to enervate the muscle fibers] input. Although 
some events require maximum performance to 
break world records or personal bests, others 
reward a superior performance relative to those 
of other competitors. The factors underpinning 
fatigue or performance power or speed are spe- 
cific to the event, the environment in which it is 
undertaken, and the individual athlete. 

Figure 2 provides a simplified summary of 
the most common fatigue factors in compet- 
itive sport to which evidence-based nutrition 
strategies can be applied to reduce or delay the 
onset. Such strategies can involve chronic pro- 
tocols that work synergistically with training to 
make the body more resilient to these factors. 
For example, in team sports involving repeti- 
tion of short (e.g., 6-s) high-intensity sprints, a 
progressive decay in speed related to the failure 
to fully recover PCr stores in the intervening 
recovery periods (>30 to 120 s) can be addressed 


“the decisive day of this race,...over 
a challenging mountainous terrain, 
was achieved with a nutrition plan 
providing 6663 kcal and 18.9 g of 
CHO per kilogram (a total of 1.3 kg, 
equivalent to ~85 slices of bread)...” 


by supplementation with creatine monohydrate 
to increase the size of the muscle PCr pool (49). 
This may not only directly enhance match per- 
formance by altering the sprint decay profile (50) 
but also allow the athlete to train harder (i.e., 
complete more sprints during training sessions) 
to increase adaptations in other physiological 
systems. Acute pre-event strategies such as CHO 
loading (24 to 48 hours of preparation) to in- 
crease the muscle glycogen stores or glycerol- 
assisted hyperhydration (2 hours of preparation) 
to increase body water storage can enhance per- 
formance in specific events if they can increase 
the time of optimal output before the body 
reaches a critical level of glycogen depletion (57) 
or fluid deficit (52), respectively. Intake of CHO 
(0) or fluid (53) during the event can also ad- 
dress these peripherally limiting factors but, 
intriguingly, may provide a benefit though a 
CNS effect associated with the oral sensing of 
these nutrients (Boxes 1 and 2). 

Lastly, for real-world events, the development 
of a competition nutrition plan is challenged by 
the complexity of addressing a multitude and 
overlay of these fatigue factors, the practical 
constraints imposed by the event or the nature of 
the exercise, and the beliefs and tolerance of the 
athlete. This is covered in Box 3 and illustrated by 
recent activities around the marathon (42.2 km). 
In 2017, in a carefully orchestrated attempt to 
break the 2-hour barrier (54) and after much 
scientific banter (55), Kenyan Eliud Kipchoge 
came within 25 s of a sub-2-hour performance. 
Kipchoge already had two of the three “success 
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factors” for prolonged endurance events: high 
aerobic power and the ability to run at a very high 
proportion of his aerobic capacity for prolonged 
periods without losing metabolic control (56). 
His attempt, albeit outside the International 
Association of Athletics Federations rules, tar- 
geted mainly the third factor: running economy 
(achieving the highest speed for the lowest oxy- 
gen cost). Strategies to improve economy in- 
cluded running on a flat motor racing track 
without sharp corners to preserve speed, run- 
ning in aerodynamic formation behind other 
runners, using a car-mounted time clock to pro- 
vide a windshield as well as pacing assistance, 
and wearing shoes developed to return 4% extra 
energy via carbon-fiber inserts (57). 

Nutritional strategies were also used to en- 
hance economy, and future improvements are 
likely. The beneficial effects of beetroot juice, 
a popular performance aid providing a supple- 
mental source of inorganic nitrate, are 
mediated through the enhancement of 
exercise economy: A secondary nitric 
oxide (NO)-generating pathway (nitrate- 
nitrite-NO) is believed to enhance NO- 
mediated increases in capillary O. delivery 
to the muscle and reduce mitochondrial 
proton leakage (58). Furthermore, despite 
current claims that ketogenic LCHF diets 
provide unlimited substrate for prolonged 
exercise (27), both century-old empirical 
data (59) and recent interventions involv- 
ing world-class race walkers (22) remind 
us that the oxidation of CHO yields ~5% more 
ATP per unit of O, than fat. Future nutrition 
strategies for the marathon may focus on in- 
creasing CHO availability and oxidation by shifting 
away from the use of fat from the mitochondrial 
furnace. Tactics include achieving maximally su- 
percompensated glycogen stores, increasing the 
opportunities for aggressive in-race feedings, and 
training the gut to use multiple CHO sources to 
increase overall intestinal absorption of CHO (22). 


Elite athletes are different 


Scrutiny of the evidence base for current sports 
nutrition guidelines reveals that the individuals 
who contribute blood, sweat, and tears to sci- 
entific investigations are at best well trained, 
often male, and almost always subelite. Inter- 
ventions with world-class athletes are rare: By 
definition, such athletes are few in number, and 
they are generally disinclined to interrupt suc- 
cessful training or nutrition programs or submit 
to invasive experimental techniques for the sake 
of science. It is reasonable to ask, therefore, 
whether the results of studies on nonelite pop- 
ulations apply to their elite counterparts. Issues 
include application of the intervention to the 
specific scenarios in which elite athletes train or 
compete, the inability of underpowered studies 
to detect small but worthwhile differences or 
changes in performance that could alter the 
outcomes of elite sport, and the translation of 
putative mechanisms to athletes who undertake 
substantially larger volumes of specialized train- 
ing and potentially possess favorable genetic 
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traits (60). In relation to some nutrition in- 
terventions, evidence suggests a diminished re- 
sponse in elite competitors. For example, beetroot 
juice supplementation appears to be less effec- 
tive in achieving economy or performance im- 
provements in elite athletes (67); explanations 
for nonresponsiveness to this supplement in 
higher-caliber athletes include their different 
muscle fiber composition and the legacy of 
physiological adaptations attained through ex- 
tensive training, such as greater activity of the pri- 


mary arginine-NO pathway (62, 63). Nevertheless, 
because this pathway is oxygen dependent, sce- 
narios can be identified in which elite athletes 
could benefit from activity from the alternative 
(oxygen- and pH-independent) nitrate-derived 
NO production, justifying beetroot juice sup- 
plementation; these include training or compet- 
ing at high altitude and competing in sports 
involving small muscle groups, such as the arms, 
in which lower blood flow increases the likelihood 
of local hypoxia and acidosis (62, 63). 


Fatigue Factors 


The challenge remains to determine whether 
elite athletes are successful because, or in spite, 
of their nutrition practices. There are few studies 
of such groups or individuals, although elite East 
African athletes who have dominated middle- 
distance and distance running for the past dec- 
ades have received scientific inquiry (64, 65). 
Their dietary patterns include consistencies with 
current athlete guidelines [high CHO intakes 
(~60 to 80% of energy) but regular use of train- 
ing in a fasted state to achieve train-low sessions], 
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Fig. 2. Nutrition can beat competition fatigue. Many factors that commonly cause fatigue (a periodic or sustained decline in the athlete’s ability to 
optimally perform) in sporting events can be addressed by nutritional strategies that reduce the effects of these factors or delay their onset. 


Burke et al., Science 362, 781-787 (2018) 
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as well as inconsistencies with either the guide- 
lines or typical practices of other elite athletes 
[reliance on vegetable (80 to 90% of diet) rather 
than animal food sources, very limited food va- 
riety, distribution of energy to a small number 
of meals in the day, and chronic periods of low 
energy availability]. Case histories of scientist- 
devised approaches to performance for elite 
competitors, such as “weight making” for a 


professional boxer (66) or a complex nutrition 
plan followed by the winner of the 3-week Giro 
d'Italia cycling race, do not necessarily allow 
firm and reproducible conclusions about the 
benefits of these strategies but show how prac- 
tices can be achieved within the complexities 
of the sporting environment (see Box 3 and 
Fig. 3). In the latter case, the competition plan 
manipulated BM (by slight energy restriction 


and the use of a low-residue diet to reduce gas- 
trointestinal contents) according to the benefits 
of being lighter on hilly sections and fluctuated 
daily energy and CHO intakes according to the 
estimated metabolic cost of each stage (67). No- 
tably, the decisive day of this race, involving a 
solo ride over a challenging mountainous ter- 
rain, was achieved with a nutrition plan provid- 
ing 6663 kcal and 18.9 g of CHO per kilogram 
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Fig. 3. Perspectives on the evidence base for elite athlete practices. Developing an evidence base for the nutritional practices of elite athletes 
requires acknowledgment that specific answers to research questions and interpretations of guidelines are needed. [Adapted from (60)] 


Burke et al., Science 362, 781-787 (2018) 


Box 3. Specificity and practicality require bespoke solutions. 


The practical implementation of nutrition strategies by athletes in 
real-world settings confounds the establishment of an evidence base 
by traditional research methods and the development of generalizable 
(and uncontroversial) guidelines. In most sports, performance is limited 
by a number of interdependent factors, typically addressed by a co- 
ordinated plan. We often consider several independently valuable nu- 
trition strategies in combination, despite the potential for redundancy, 
amplification, attenuation, and competition between effects. The in- 
dividual benefits of fluid and CHO replacement for performance in the 
heat are additive (88), caffeine is less effective when CHO intake is 
also used to attenuate the performance decline during prolonged 
exercise (89), and combining caffeine and bicarbonate supplementa- 
tion impairs the benefits of the former because of gastrointestinal 
disturbances (90). However, because it is impractical to investigate 
the many permutations and combinations of evidence-based nutrition 
strategies (90), the overall effects on performance are unknown. 

Environmental conditions and competition schedules add further 
practical challenges. Premiere championships can be hosted under 
“hostile” conditions (such as high altitude in the 1968 Mexico City 
Olympic Games or heat in the 2019 Doha Athletics World Champion- 
ships) or with unusual timetables (such as late-night swimming at the 
2016 Rio Olympic Games to coincide with primetime television in the 
United States). Thus, athletes often need bespoke strategies for dif- 
ferent iterations of the same event. Lastly, sport involves rules, logistical 
considerations, and cultures that dictate opportunities for nutrient intake 
before, during, and between events (91). Some provide adequate oppor- 
tunities for beneficial intake (e.g., basketball players drink during sub- 
stitutions and time-outs). However, conditions in other events, such as 
weight-division sports, encourage substantial pre-event dehydration and 
energy restriction to “make weight” (92). Soccer prohibits breaks or access 
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to fluids during each half, and the practical challenge of drinking while 
running at ~21 km/hour in a marathon limits the volumes ingested (68). 

It is important to consider whether the traditional conduct and evaluation 
of scientific research adequately inform elite athletes. In many areas of 
our lives, we are content with generalizable truths (strategy X is good) 
and guidelines (we should all implement strategy X by doing Y). Figure 3 
illustrates a hierarchy of types of scientific evidence. The types of studies 
that provide the strongest or highest-quality evidence (such as randomized 
controlled trials) are extremely hard to achieve with high-caliber competitors 
or may provide generic information inappropriate for a specific task. Lack of 
appreciation of these concepts has caused angst within the sports science 
community and an unfair dismissal of the integrity of its outputs. 

Sports science was criticized in an assessment by epidemiologically trained 
scientists (93, 94). Although valid methodological issues were raised, the 
analysis failed to appreciate that sports scientists working in elite sport 
typically seek highly context-specific information (Fig. 3). Evidence-based but 
bespoke solutions for small numbers of individuals require special designs and 
research tools; the validity, reliability, and sensitivity of measurements are critical. 
We must consider repeatability in study design or case history approaches to 
account for small sample sizes. Even individual responsiveness to interventions 
may vary. New or refined statistical approaches may be required (60, 90). 

Controversy regarding guidelines for fluid intake during sport ex- 
emplifies a lack of appreciation for context. Critics of current guidelines 
for an individualized approach (95) who argue that athletes should be 
told to drink only when thirsty during events (96) fail to recognize that 
opportunities for fluid intake are often beyond the athlete's control and 
unrelated to need. Therefore, it is reasonable to develop a bespoke plan 
for specific events that optimizes opportunities to consume fluid and 
CHO before and throughout the event to integrate gut comfort, fuel 
needs, a tolerable fluid deficit, and thirst management. 
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(a total of 1.3 kg, equivalent to ~85 slices of bread), 
with race supplies being provided by domestique 
teammates and a support crew at planned in- 
tervals to avoid the need for the cyclist to carry 
the weight burden (67). We can never know how 
much this plan contributed to the rider’s even- 
tual success. Equally, we need to intellectualize 
that athletic success can be achieved in the face 
of apparently suboptimal practice. For example, 
that the winner of an elite marathon incurred a 
loss of 10% of BM over the race (68) fails to dis- 
prove that hypohydration impairs performance; 
rather, it demonstrates that this athlete was faster 
than other competitors on that day and poten- 
tially was best able to tolerate the conditions. 

Although elite athletes can learn from sports 
science, many lessons have also flowed in the 
opposite direction. Sports nutrition reeommen- 
dations have often been updated when prac- 
tices observed among elite athletes were found 
to be beneficial. For example, caffeine guidelines 
for sport changed when the flat cola beverages 
consumed by elite cyclists toward the end of 
prolonged races (~1 to 2 mg of caffeine per 
kilogram at the onset of fatigue) were found to 
be as effective as the “scientifically proven” 
protocols (6 to 9 mg/kg taken 1 hour pre-event) 
(69). Ammunition to update guidelines for CHO 
intake during prolonged (>2.5 hours) events 
came from observations that the intakes of many 
elite cyclists and triathletes (~90 g/hour) were 
higher than the earlier recommendations (30 
to 60 g/hour) and correlated with success (70). 
Clearly, future outcomes will be best achieved 
with a two-way interaction between sports sci- 
entists and elite athletes and their coaches. 

In the final analyses, modern sports nutrition 
offers a feast of opportunities to assist elite 
athletes to train hard, optimize adaptation, stay 
healthy and injury free, achieve their desired 
physique, and fight against fatigue factors that 
limit success. Although there will be challenges 
and changes to sports nutrition guidelines as 
they evolve beyond the frontiers of current knowl- 
edge and practice, we can be excited to know that 
sports science in many guises contributes to the 
outcomes that delight and amaze us from our 
sofas and the grandstand. 
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TROPICAL ark. 
Warm water and big winds 


Edited by Stella Hurtley 


Tropical’storm Nate 
caused’extensive 
damage in Costa Rica 
in October 2017. 


he 2017 North Atlantic hurricane season was highly active, with six major storms—nearly two 
standard deviations above the normal number. Three of those storms made landfall over the 
Gulf Coast and the Caribbean, causing terrible damage and loss. Why was the season so fierce? 
Murakami et al. used a suite of high-resolution model experiments to show that the main cause 
was pronounced warm sea surface conditions in the tropical North Atlantic. This effect was dis- 
tinct from La Nifia conditions in the Pacific Ocean that were involved in other years. It remains unclear 
how important anthropogenic forcing may be in causing such increased hurricane activity. -HJS 


Science, this issue p. 794 


The science of art 


advancement 

Art appreciation is highly subjec- 
tive. Fraiberger et al. used an 
extensive record of exhibition and 
auction data to study and model 
the career trajectory of individual 
artists relative to a network of 
galleries and museums. They 
observed a lock-in effect among 
highly reputed artists who started 
their career in high-prestige 
institutions and a long struggle for 
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access to elite institutions among 
those who started their career at 
the network periphery. —BJ 


Science, this issue p. 825 


Innovating to be 
nondisruptive 


Insights into the architecture 
and stoichiometry of mem- 
brane complexes have grown 
with advances in cryo-electron 
microscopy and native mass 
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spectroscopy. However, most 
of these studies are not in the 
context of native membrane. 
Chorev et al. released intact 
membrane complexes directly 
from native lipid membrane 
vesicles into a mass spectrom- 
eter. They analyzed components 
of the Escherichia coli inner 
and outer membranes and the 
bovine mitochondrial inner 
membrane. For several identi- 
fied complexes, they found a 
stoichiometry that differs from 
published results and, in some 
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cases, confirmed interactions 
that could not be characterized 
structurally. —VV 

Science, this issue p. 829 


A programmable type 
of CRISPR system 


CRISPR-Cas9 systems have been 
causing a revolution in biology. 
Harrington et al. describe the 
discovery and technological 
implementation of an additional 
type of CRISPR system based on 
an extracompact effector protein, 
Casl14. Metagenomics data, 
particularly from uncultivated 
samples, uncovered the CRISPR- 
Cas14 systems containing all the 
components necessary for adap- 
tive immunity in prokaryotes. At 
half the size of class 2 CRISPR 
effectors, Casl4 appears to target 
single-stranded DNA without 
class 2 sequence restrictions. By 
leveraging this activity, a fast and 
high-fidelity nucleic acid detec- 
tion system enabled detection 

of single-nucleotide polymor- 
phisms. —SYM 


Science, this issue p. 839 


Patterned fiber formation 
The ability of liquid crystalline 
materials to order spontaneously 
has driven many innovations, 
from display technologies to 
extremely tough polymer fibers. 
Cheng et al. exploited this pre- 
ponderance toward long-range 
ordering to direct the growth of 
nonliquid crystalline polymers 
into sheets of highly ordered 
fibers. Small changes to the pro- 
cessing conditions could be used 
to tweak the arrangement of the 
liquid crystals to generate a wide 
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range of polymer mats or sheets 

for potential use in sensing or 

filtration applications. —MSL 
Science, this issue p. 804 


DNA binding as an 
anti-inflammatory 


Mice that lack the gene encoding 
8-oxoguanine DNA glycosylase 1 
(OGGI1) show resistance to 
inflammation. This enzyme binds 
to sites of oxidative DNA damage 
and initiates DNA base excision 
repair. Visnes et al. developed 
a small-molecule drug that 
acts as a potent and selective 
active-site inhibitor that stops 
OGGI1 from recognizing its DNA 
substrate (see the Perspective 
by Samson). The drug inhibited 
DNA repair and modified OGG1 
chromatin dynamics, which 
resulted in the inhibition of 
proinflammatory pathway genes. 
The drug was well tolerated by 
mice and suppressed lipopoly- 
saccharide- and tumor necrosis 
factor-a—mediated neutrophilic 
inflammation in the lungs. —STS 
Science, this issue p. 834; 
see also p. 748 


Chemically reversible 
hydrogels 


The dynamic reorganization 

of some cellular biopolymers 

in response to signals has 
inspired efforts to create artificial 
materials with similar properties. 
Freeman et al. created hydrogels 
based on peptide amphiphiles 
that can bear DNA strands that 


Illustration of triggered superstructure 
changes 
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assemble into superstructures 
and that disassemble in response 
to chemical triggers. The addition 
of DNA conjugates induced tran- 
sitions from micelles to fibers and 
bundles of fibers. The resulting 
hydrogels were used as an extra- 
cellular matrix mimic for cultured 
cells. Switching the hydrogel 
between states also switched 
astrocytes between their reactive 
and naive phenotypes. —PDS 


Science, this issue p. 808 


Impact crater under ice 
Ancient meteorite impact cra- 
ters have been found across the 
surface of Earth. Kjzer et al. per- 
formed an ice-penetrating radar 
analysis of the Hiawatha Glacier 
in northwest Greenland and dis- 
covered an impact crater under 
Earth's ice sheets. The oldest 

ice in this crater is debris-ridden 
or heavily disturbed, suggesting 
that the impact postdates initia- 
tion of the ice sheet. Sediments 
carried by a river draining out of 
the Hiawatha Glacier included 
grains that were physically 
shocked in an impact by a rela- 
tively rare iron meteorite. Models 
suggest that the meteorite must 
have been on the scale of a kilo- 
meter wide. The crater may have 
formed relatively recently during 
the Pleistocene. —PJB 

Sci. Adv. 10.1126/sciadv.aar8173 (2018). 


Lung cancer search 
and destroy 


Like many cancer types, lung 
cancer is easier to treat when it 
is detected in its early stages. 
Scafoglio et al. discovered that 
a glucose transporter called 
sodium-dependent glucose 
transporter 2 is specifically 
found in early-stage lung tumors. 
They used a receptor-specific, 
radiolabeled tracer to perform 
positron emission tomogra- 
phy to identify early tumors. 
Furthermore, a class of diabetes 
drugs called gliflozins, which 
target the same receptor, effec- 
tively targeted these lung tumors 
in mouse models. —YN 

Sci. Transl. Med. 10, eaat5933 (2018). 


GALAXIES 


Edited by Caroline Ash 
and Jesse Smith 


An infrared image 
of the Large 
Magellanic Cloud 


Big dwarfs have little dwarfs 


alaxies grow hierarchically: Small galaxies merge into 
bigger ones. Large galaxies like our Milky Way are sur- 
rounded by dozens of satellite dwarf galaxies, which are 
still in the process of merging. Those satellite galaxies 
should themselves be accompanied by even smaller 
dwarfs, but there is little observational evidence for this. 
Kallivayalil et al. examined 32 recently discovered dwarf gal- 
axies by comparing astrometric data from the Gaia mission 
to simulations. They show that at least four of the dwarfs are 
associated with the Large Magellanic Cloud, the largest satel- 
lite galaxy of the Milky Way. This is consistent with standard 


cosmology. —KTS 
Astrophys. J. 867,19 (2018). 


Anew immune 
syndrome identified 


The causes of later-onset 
immune deficiencies are elusive, 
but the symptoms can be 
distressing. Anew syndrome 
that leads to global immune dys- 
regulation has been discovered 
independently in two unrelated 
patients, one from Australia and 
the other in Japan. Using whole- 
genome sequencing, Cardinez 
et al. found that both patients 
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had a mutation in a gene called 
inhibitor of nuclear factor kappa- 
B kinase subunit beta (IKBKB). 
The mutation causes the 
destruction of white blood cells 
known as lymphocytes, leading 
to excessive inflammation and 
recurrent infections. CRISPR- 
Cas technology was used to 
precisely engineer the same 
mutation into mice and generate 
immunodeficiency similar to that 
observed in human patients. This 
discovery highlights the value of 
rare-disease research and offers 
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CONSERVATION 


Hybrid history 
ybridization between species has long 
presented a problem for the identifica- 
tion of biological species boundaries 
and, more recently, for establishing 
conservation priorities. Although 
the strict biological definition of a species 
states that it is reproductively isolated, 
many clearly defined species regularly 
hybridize, to the point that new species may 
emerge. Despite this, classifying hybrids 
and identifying their conservation status 
has been problematic. Pacheco-Sierra et al. 
attempted to address this challenge in the 
case of two species of crocodile (American 
and Morelet’s), which have been hybridizing 
for millions of years. Although more “pure” 
populations of each species exist, there 
is considerable hybridization throughout 
their range of overlap. Hence, conservation 
focus on only the nonhybridized popula- 
tions would exclude a range of natural, and 
presumably adaptive, hybrids and millions of 
years of diversity. -SNV 


Front. Ecol. Evol. 6, 138 (2018). 


The American crocodile 
(shown here) has been 
hybridizing with the Morelet’s 
crocodile for millions of years. 


hope to patients with conditions 
that have escaped diagnosis. 
—PNK 


J. Exp. Med. 215, 2715 (2018). 


SYNTHETIC BIOLOGY 
Controls for a synthetic 
RNA circuit 


Gene therapy generally relies on 
delivering DNA into cells along 
with strategies to control its 
expression. Synthetic messen- 
ger RNA (mRNA) is an attractive 
alternative gene therapy vehicle 
for applications that require 
transient protein expression, 
but controlling this expres- 

sion remains challenging. One 
approach is to add a degrada- 
tion domain to the protein, 

but this may compromise its 
proper function. Wagner et al. 
engineered small-molecule- 
responsive RNA binding proteins 
(RBPs) to control expression 

of proteins from synthetic 
mRNA. By regulating binding 

of the RBPs, they can regulate 
the timing and magnitude of 
expression of reporter proteins 
in engineered circuits that use 
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either synthetic RNAs with 
base modifications (modRNA) 
designed to decrease immuno- 
genicity or self-replicating RNAs 
(replicons) that give high levels 
of expression. —VV 

Nat. Chem. Biol. 14,1043 (2018). 


FOREST ECOLOGY 
Declining wood 
in disturbed forest 


The wood density of forest trees 
is an indicator of their potential 
to store carbon. Berenguer et al. 
compared the wood densities of 
trees and saplings in disturbed 
and undisturbed primary and 
secondary forests in eastern 
Amazonia. They found that wood 
density within disturbed forest is 
not recovering to the predistur- 
bance values, with reductions as 
great as 33% in secondary for- 
est. Key factors contributing to 
these reductions are proximity to 
forest edges, which impedes the 
recovery of high—wood density 
tree species, and infestation by 
lianas (woody vines) in disturbed 
forest, which also have an 
inhibitory effect on tree growth. 
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As forests are increasingly 
disturbed in Amazonia and other 
tropical areas, their capacity to 
act as carbon sinks will dimin- 
ish. —AMS 

J. Ecol. 106, 2190 (2018). 


METABOLIC PATHWAYS 
Roadblocks removed 
by rapid evolution 


Cellular metabolism is an inter- 
connected network of chemical 
reactions, each step catalyzed 
by dedicated enzymes. Inhibition 
or intentional removal of an 
enzyme should grind the path- 
way to a halt, a fact exploited 

by many drugs. However, given 
an adequate level of variation, 
cells inevitably circumvent such 
roadblocks. Pontrelli et a/. chal- 
lenged a strain of Escherichia coli 
with increased mutation rates 

to overcome a blocked biosyn- 
thetic pathway for the essential 
cofactor coenzyme A. Not one, 
not two, but three independent 
shortcuts around the roadblock 
were selected for, each after 
removal of the former. Such plas- 
ticity underlies the well-known 
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ability of microorganisms to 
evade antibiotics. -MAF 
Nat. Chem. Biol. 14,1005 (2018). 


ORGANIC CHEMISTRY 
Aradical approach to 


asymmetric amines 


Converting aliphatic C-H bonds 
into C—N bonds is a broadly use- 
ful reaction in pharmaceutical 
research. A persistent challenge 
is to obtain just one of the two 
possible mirror-image products, 
or enantiomers, in this context. 
Li et al. report an odd-electron 
cobalt porphyrin complex that 
catalyzes highly enantioselective 
intramolecular C-H amination 
in a variety of substrates with 
pendant sulfamoyl azides. The 
reaction selectively targets 
carbon centers that are five 
atoms away from the activated 
nitrogen, producing six-mem- 
bered rings that can be opened 
by hydrolytic sulfonyl! removal. 
Isotopic labeling and trapping 
studies implicate a radical 
mechanism. —JSY 

Angew. Chem. Int. Ed. 10.1002/ 

anie.201808923 (2018). 
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CELL METABOLISM 
Mitochondrial serine 
transporter identified 


One-carbon (1C) metabolism is a 
universal metabolic process that 
is required for purine synthesis 
and supports the high levels of 
proliferation in cancer cells. The 
transport of serine into mito- 
chondria supplies most of the 1C 
units needed for biosynthesis. 
Kory et al. used a genetic screen 
to identify the long-sought-after 
mitochondrial serine transporter. 
Elucidating the key step of serine 
transport is important for our 
understanding of metabolism 
and has potential implications 
for cancer treatment. —SYM 
Science, this issue p. 791 


NEUROGENOMICS 
Mapping the brain, one 
neuron at a time 


Spatial transcriptomics can link 
molecularly described cell types 
to their anatomical positions 
and functional roles. Moffitt et al. 
used a combination of single-cell 
RNA-sequencing and MERFISH 
(multiplexed error-robust fluo- 
rescence in situ hybridization) 
to map the identity and location 
of specific cell types within the 
mouse preoptic hypothalamus 
and surrounding areas of the 
brain (see the Perspective by 
Tasic and Nicovich). They related 
these cell types to specific 
behaviors via gene activity. The 
approach provides an unbiased 
description of cell types of the 
preoptic area, which are impor- 
tant for sleep, thermoregulation, 
thirst, and social behavior. —LMZ 
Science, this issue p. 792; 
see also p. 749 


NEURODEVELOPMENT 
Development of human 
brain neurons 


The earliest stages of human 
brain development are very 
difficult to monitor, but using 
induced pluripotent stem cells 
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(iPSCs) can help to elucidate 
the process. Real et al. trans- 
planted neural progenitors 
derived from human iPSCs 
into the brains of adult mice. 
They used intravital imaging to 
visualize how resulting neu- 
rons grew and connected. The 
human cells produced neurons 
that integrated and developed 
synaptic networks with oscilla- 
tory activity. Dendritic pruning 
was observed and involved a 
process of branch retraction, not 
degeneration. Cells derived from 
individuals with Down syndrome, 
upon transplantation into the 
mouse brain, produced neurons 
that grew normally but showed 
reduced dendritic spine turnover 
and less network activity. —PJH 
Science, this issue p. 793 


ORGANIC CHEMISTRY 
Heterocycles meet and 
marry on phosphorus 


Metals such as palladium are 
routinely used to link together 
carbon rings in pharmaceutical 
synthesis. However, the pres- 
ence of nitrogen in both rings 
can trip up this process. Hilton et 
al. report a versatile alternative 
process in which phosphorus 
takes the place of the metal. The 
phosphorus binds successively 
to both rings at the sites oppo- 
site the nitrogen, and treatment 
with acidic ethanol then pushes 
them off, bound to each other. 
Theory implicates a five-coordi- 
nate phosphorus intermediate 
that kinetically favors coupling 
of the two nitrogen-bearing rings 
over reactions of the other all- 
carbon substituents. —JSY 
Science, this issue p. 799 


BIOHYBRID MICROBES 
Light-powered cell 
factories 


Bacteria and fungi are used 
industrially to produce com- 
modity fine chemicals at vast 
scale. Sugars are an eco- 
nomical feedstock, but many 
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of the desired products require 
enzymatic reduction, meaning 
that some of the sugar must be 
diverted to regenerate the cel- 
lular reductant NADPH (reduced 
form of nicotinamide adenine 
dinucleotide phosphate). Guo 

et al. show that electrons from 
light-sensitive nanoparticles 
can drive reduction of cellular 
NADPH in yeast, which can then 
be used for reductive biosyn- 
thetic reactions. This system 
can reduce diversion of carbon 
to NADPH regeneration and 
should be compatible with many 
existing engineered strains of 
yeast. —MAF 


Science, this issue p. 813 


NANOMATERIALS 
Wafer-scale hBN 


crystalline films 


Although wafer-scale poly- 
crystalline films of insulating 
hexagonal boron nitride (hBN) 
can be grown, the grain boundar- 
ies can cause both scattering 

or pinning of charge carriers 

in adjacent conducting layers 
that impair device performance. 
Lee et al. grew wafer-scale 
single-crystal films of hBN by 
feeding the precursors into 
molten gold films on tungsten 
substrates. The low solubility 

of boron and nitrogen in gold 
caused micrometer-scale grains 
of hBN to form that coalesced 
into single crystals. These films 
in turn supported the growth 

of epitaxial wafer-scale films of 
graphene and tungsten disulfide. 
—PDS 


Science, this issue p. 817 


ULTRAFAST DYNAMICS 
Physics and chemistry 


in concert 


Shining a short, intense light 
pulse on a material can cause 

a transition in both its atomic 
and electronic structures. The 
dynamics of the electronic 
structure in such transitions can 
be monitored using, for example, 
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time-resolved photoemission 
spectroscopy. Nicholson et al. 
observed a photoinduced metal- 
insulator transition in indium 
nanowires on a silicon surface. 
They monitored both the physics 
and the chemistry of the system 
after the initial photoexcitation 
and correlated the closing of 
the electronic bandgap with 
the rearrangement of chemical 
bonds. The results showcase 
the wealth of information that 
time-resolved tools can reveal 
about the dynamics of complex 
systems. —JS 

Science, this issue p. 821 


CELL BIOLOGY 
Cell transitions in 
pathology 


Endothelial cells line the 
vasculature. These plastic cells 
can undergo a cell fate transi- 
tion to produce mesenchymal 
cells, Known as the endothelial- 
to-mesenchymal transition 
(EndMT). This transition is 
important in embryonic heart 
development and has also been 
observed in vascular patholo- 
gies, such as atherosclerosis. 

In a Perspective, Dejana and 
Lampugnani discuss the debate 
surrounding how EndMT contrib- 
utes to disease and whether it 
can be targeted to treat various 
pathologies associated with 
vascular and extracellular matrix 
dysfunction. —GKA 


Science, this issue p. 746 


CONSERVATION 

Impacts of outdoor 
artificial light 

The use of artificial light at night 
is increasing around the world, 
causing both direct emissions, 
mostly in the vicinity of the 
light, and a brightening of the 
night sky called skyglow. Ina 
Perspective, Gaston explains 
that this light is changing the 
activity patterns of many animal 
species and can even affect the 
timing of budburst in temperate 
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trees. Outdoor artificial light at 
night may also affect human 
sleep patterns and human 
health. The ongoing shift to 
light-emitting diode (LED) lamps 
with a broader and colder light 
spectrum is exacerbating these 
problems. —JFU 


Science, this issue p. 744 


CARDIAC DYSFUNCTION 
Breaking mitochondria 
and hearts 


Blocking the excessive mito- 
chondrial fission mediated by 
dynamin-related protein 1 (Drp1) 
that occurs after myocardial 
infarction prevents cardiac 
dysfunction from developing. 
Nishimura et al. found that the 
cytoskeletal regulator filamin 
increased Drpl activity after 
myocardial infarction and that 
the filamin-Drpl interaction 
was inhibited by the U.S. Food 
and Drug Administration— 
approved drug cilnidipine (see 
the Focus by Boyer and Eguchi). 
Administering cilnidipine to 
mice after myocardial infarc- 
tion reduced mitochondrial 
fission and cardiac dysfunction, 
suggesting that this drug could 
be repurposed to reduce heart 
attack—-induced damage. —WW 
Sci. Signal. 11, eaat5185, 
eaav3267 (2018). 


IMMUNOGENETICS 
Fine-tuning CD8*T cell 
responses 


Human cytotoxic CD8* T cells 
are important for defense 
against viral infections. Boelen 
et al. investigated whether 
inhibitory killer cell immunoglob- 
ulin-like receptors (iKIRs) carried 
by patients with chronic viral 
infections affected the efficacy 
of their CD8* T cell responses. 
Possession of an iKIR gene along 
with a gene encoding a KIR 
ligand enhanced protective and 
detrimental human leukocyte 
antigen (HLA) class | associa- 
tions for HIV-1, hepatitis C virus, 
and human T cell leukemia virus 
type 1. Analysis of virus dynam- 
ics, in vitro survival assays, 

and mathematical modeling 
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suggested that iKIR ligation 
enhances HLA associations 

by increasing T cell survival. 

In contrast to many reported 
iKIR-disease associations, these 
observations applied to all iKIRs 
and the three viral infections 
studied. —IW 


Sci. !mmunol. 3, eaao2892 (2018). 
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INTRODUCTION: One-carbon metabolism 
generates the one-carbon units required to 
synthesize many critical metabolites, includ- 
ing nucleotides, amino acids, and lipids. The 
pathway has cytosolic and mitochondrial 
branches, and a key step is the entry, through 
an unknown mechanism, of serine into mitochon- 
dria, where it is converted into glycine and 
formate. 

In dividing mammalian cells, the mitochon- 
drial catabolism of serine supplies most of the 
one-carbon units needed for biosynthesis. In- 
deed, many cancers rely on the one-carbon 
units generated from serine for proliferation, 
and mitochondrial serine catabolism enzymes 
are commonly up-regulated in tumors. Given 
that the entry of serine into mitochondria is a 
critical step in the generation of one-carbon 
units, it is surprising that the mitochondrial 
transporter(s) for serine remains unknown. 


RATIONALE: To seek the trans- bes Aut 
porter responsible for serine im- 
port into mitochondria, we designed 
a CRISPR-Cas9-mediated genetic 
screen in human cells based on 
the likelihood that loss of mito- 
chondrial serine transport will 
reduce the proliferation of cells 
lacking the cytosolic branch of the 
one-carbon metabolism pathway. 
Therefore, we aimed to identify ages 
genes that are synthetic lethal with 8 
serine hydroxymethyl transferase-1 Br, 


(SHMT]), a key cytosolic enzyme oe OD 


of the pathway. Moreover, we rea- 
soned that even if there are re- 
dundant mechanisms for serine 
transport, we can sensitize cells 
to its partial inhibition by lower- 
ing cytosolic serine concentrations, 
which is easily achieved by remov- 
ing exogenous serine. Thus, we 
sought genes required for the op- 
timal proliferation of cells lacking 
the cytosolic one-carbon pathway 
when cultured in serine-free media. 


Kory et al., Science 362, 791 (2018) 
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We picked the human blood Jurkat and K562 
cancer cell lines for our screen because of 
their high mitochondrial one-carbon metab- 
olism activity and suitability for screening and 
transduced cells with a lentiviral single guide 
RNA (sgRNA) library that targets ~3000 meta- 
bolic enzymes, small-molecule transporters, 
and metabolism-related transcription factors. 


RESULTS: This screen yielded genes in the 
pathways for serine and purine biosynthesis 
and with known functions in one-carbon me- 
tabolism. In both cell lines, only one gene of 
unknown molecular function scored, side- 
roflexin 1 (SFXN1), a multipass mitochondrial 
membrane protein. SfrnI was originally iden- 
tified as the gene mutated in a mouse mutant 
with anemia and axial skeletal abnormalities, 
and is part of the sideroflexin family of pro- 
teins conserved throughout eukaryotes. In hu- 
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SFXN1 transports serine into mitochondria and is a 
component of the one-carbon metabolism pathway. Serine is 
converted to formate in mitochondria, which is then exported 
to the cytosol to generate one-carbon units carried on tetrahy- 
drofolate (THF) for nucleotide synthesis and other processes. In 
this diagram, the mitochondria are represented by STED super- 
resolution images of cells stained both for FLAG-SFXN1 and the 
outer mitochondrial membrane marker Tom20. 
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mans, SFXN1 is highly expressed in the blood, 
liver, and kidney, which are tissues with high 
one-carbon metabolism activity. STED super- 
resolution microscopy confirmed that SFXN1 
localizes to the inner and not outer mitochon- 
drial membrane, in agreement with a potential 
role as a metabolite transporter. The prolifera- 
tion defect of SEXN/-null cells in serine-depleted 
media was completely reversed by adding for- 

mate, the product of the 
mitochondrial one-carbon 
Read the full article pathway, which directly 
at http://dx.doi. implicates an insufficient 
org/10.1126/ supply of one-carbon units 
science.aat9528 in their defective prolif- 
eration. Notably, expression 
of an sgRNA-resistant SFXNI cDNA restored the 
proliferation rate of the SEXN7-null cells to that of 
the wild-type cells. 

Serine tracing experiments place SFXNI1 in 
the mitochondrial branch of the one-carbon 
pathway. Like cells missing mitochondrial 
components of one-carbon metabolism, those 
null for SFXN1 are defective in glycine and 
purine synthesis and have reduced levels of 
charged folate species. Cells lacking SFXN1 
and one of its four homologs, SFXN3, have 
more severe defects, including mitochondrial 
dysfunction and being auxotrophic for glycine. 
Several human SFXN family members can 
complement SFXN1-3 double loss, as can their 
yeast and Drosophila orthologs. These results 
were confirmed when SFXN3 emerged as one 
of the top synthetic lethal genes with SFXN7/ in 
the absence of exogenous glycine. To test whether 
SFXNI can directly transport serine, we purified 
the FLAG-tagged protein from mammalian cells 
and reconstituted it into liposomes. 
Recombinant SFXN1 mediated serine 
% uptake into liposomes. Conversely, ser- 
ine uptake into mitochondria isolated 
from SEXN7-null cells was decreased. 
SFXNI may have other physiologically 
relevant transport substrates besides 
serine, including cysteine and alanine. 


] 
\ CONCLUSION: SFXN1 functions as 


a mitochondrial serine transporter in 
one-carbon metabolism. As there are 
multiple sideroflexins and their ex- 
pression varies across tissues, SEXN1 
and its homologs may turn out to be 
important nodes for regulating the fate 
of serine in cells. Because SFXN1 is ex- 
pressed in many cancers and its ex- 
pression is likely regulated by the Myc 
transcription factor, it may also have un- 
explored roles in cancer cell growth. 


The list of author affiliations is available in the full 
article online. 

*These authors contributed equally to this work. 
{Corresponding author. Email: sabatini@wi.mit.edu 
Cite this article as N. Kory et al., Science 362, 
eaat9528 (2018). DOI: 10.1126/science.aat9528 
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CELL METABOLISM 


SFXNI is a mitochondrial serine 
transporter required for 
one-carbon metabolism 


Nora Kory”’”**, Gregory A. Wyant’”?**, Gyan Prakash’*, Jelmi uit de Bos’, 
Francesca Bottanelli>+, Michael E. Pacold””*****+, Sze Ham Chan’, Caroline A. Lewis’, 
Tim Wang”**, Heather R. Keys', Yang Eric Guo’, David M. Sabatini®”*?»*§ 


One-carbon metabolism generates the one-carbon units required to synthesize many 
critical metabolites, including nucleotides. The pathway has cytosolic and mitochondrial 
branches, and a key step is the entry, through an unknown mechanism, of serine into 
mitochondria, where it is converted into glycine and formate. In a CRISPR-based genetic 
screen in human cells for genes of the mitochondrial pathway, we found sideroflexin 

1 (SFXN1), a multipass inner mitochondrial membrane protein of unclear function. Like 
cells missing mitochondrial components of one-carbon metabolism, those null for SFXN1 
are defective in glycine and purine synthesis. Cells lacking SFXN1 and one of its 

four homologs, SFXN3, have more severe defects, including being auxotrophic for glycine. 
Purified SFXN1 transports serine in vitro. Thus, SFXN1 functions as a mitochondrial 


serine transporter in one-carbon metabolism. 


ne-carbon metabolism uses serine to gen- 
erate the reactive one-carbon donors, such 
as 5,10-methylene-tetrahydrofolate, re- 
quired for many basic processes, including 
nucleotide and lipid synthesis [reviewed 
in (J-3)]. An interesting aspect of the one-carbon 
pathway is that although partially redundant 
isozymes exist in the cytosol and mitochondrial 
matrix, in most proliferating cells, the pathway 
primarily flows from the cytosol into mitochon- 
dria and back out (Fig. 1A). Cytosolic serine enters 
the mitochondrial matrix and is converted to gly- 
cine and formate, which then exits to the cytosol 
where it is used to generate the charged folates 
that serve as one-carbon donors (4) (Fig. 1A). In 
dividing mammalian cells, the mitochondrial ca- 
tabolism of serine supplies most of the one-carbon 
units needed for biosynthesis (5-9), but the cy- 
tosolic branch can compensate for its loss (8). 
Given that the entry of serine into mitochon- 
dria is a critical step in the generation of one-carbon 
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units, it is surprising that the mitochondrial 
transporter(s) for serine remains unknown 
(5, 10-12). To seek such a transporter, we de- 
signed a CRISPR-Cas9-mediated genetic screen 
based on the likelihood that loss of mitochon- 
drial serine transport will reduce the prolifera- 
tion of cells lacking the cytosolic branch of 
one-carbon metabolism. Moreover, we reasoned 
that even if there are redundant mechanisms 
for serine transport, we can sensitize cells to its 
partial inhibition by lowering cytosolic serine 
concentrations, which is easily achieved by re- 
moving exogenous serine (fig. SIA) (73). Thus, 
we sought genes required for the optimal pro- 
liferation of cells lacking the cytosolic one-carbon 
pathway when cultured in serine-free media. 


A genetic screen for components of the 
mitochondrial one-carbon metabolism 
pathway yields SFXN1 


To implement such a screening strategy, we 
first generated human Jurkat leukemic T cells 
and K562 erythroleukemic cells null for serine 
hydroxymethyltransferase 1 (SHMT1), an isozyme 
of mitochondrial SHMT2 and a key component 
of the cytosolic one-carbon pathway that inter- 
converts serine and glycine (Fig. 1A). We chose 
Jurkat and K562 cells because they are suitable 
for screening (14, 15) and have high mitochon- 
drial one-carbon pathway activity (8, 16). We 
transduced the SHMT7-null cells with a lentiviral 
single guide RNA (sgRNA) library that targets 
~3000 metabolic enzymes, small-molecule trans- 
porters, and metabolism-related transcription 
factors (~10 sgRNAs per gene) and also con- 
tains 499 control sgRNAs (15). The transduced 
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cells were cultured in RPMI media with or with- 
out serine, and for each gene, we generated a gene 
score by calculating the mean log, fold-change 
in the abundance from the beginning to end of 
the culture period of all the sgRNAs targeting the 
gene (15) (Fig. 1B). We also obtained a differential 
gene score that reflects the relative importance 
of the gene in the presence or absence of serine. 

As expected, most genes, as well as the control 
sgRNAs, had similar scores in cells cultured under 
both media conditions (Fig. 1C). Multiple classes of 
genes behaved as predicted. For example, in both 
cell lines, the three genes in the serine synthesis 
pathway (PHGDH, PSAT1, PSPH) were required 
for proliferation in serine-free media, as were com- 
ponents of the purine synthesis pathway, which is 
downstream of one-carbon metabolism (Fig. 1D 
and fig. SIB). Established components of the mito- 
chondrial one-carbon pathway, such as SHMT2 
and the mitochondrial folate transporter (MFT), 
scored differentially in Jurkat cells, as did 5,10- 
methenyltetrahydrofolate synthetase (MTHFS), 
which returns 5-formyl-tetrahydrofolate to the tetra- 
hydrofolate (THF) cofactor pool (17), in K562 cells. 

Notably, the only gene of unknown molecular 
function that scored differentially in both cell 
lines was the mitochondrial transmembrane pro- 
tein sideroflexin 1 (SFXN1) (Fig. 1, C and D, and 
fig. S1, B and C). SfrnI was originally identified 
as the gene mutated in a mouse mutant with 
anemia and axial skeletal abnormalities and is 
part of the sideroflexin family of proteins con- 
served throughout eukaryotes (78, 19). In humans, 
SFXN1 is highly expressed in the blood, liver, and 
kidney, which are tissues with high one-carbon 
metabolism activity (fig. S2, A and B). 

To follow up the screen, we generated Jurkat 
and K562 cells lacking SFXN1 alone or in com- 
bination with SHMTI (fig. S1D). In the absence 
of serine, the SFXNI-null Jurkat cells prolifer- 
ated more slowly than the wild-type or AAVS1- 
targeted control cells, and the loss of SHMT1 
exacerbated the defect, consistent with the screen- 
ing results (Fig. 1E). The SFXNV7-null K562 cells 
proliferated less well than their Jurkat counter- 
parts, but SHMT1 deletion did not exacerbate 
the defect, likely because K562 cells express very 
low levels of SHMT1 to begin with (Fig. 4C). The 
addition of formate, the product of the mitochon- 
drial one-carbon pathway, completely reversed 
the slow proliferation of the Jurkat and K562 
single- and double-null cells in the serine-free 
media, directly implicating an insufficient supply 
of one-carbon units in their defective prolifera- 
tion (Fig. 1E). Notably, expression of an sgRNA- 
resistant SEXNI cDNA restored the proliferation 
rate of the SFXNI-null cells to that of the wild- 
type cells (Fig. 1F). SHMT2-null cells exhibited 
similar albeit more profound proliferation de- 
fects than the cells lacking SFXN/ (Fig. 1E). 


Loss of SFXN1 phenocopies mutants in 
mitochondrial one-carbon metabolism 
SFXNI1 localizes to the inner mitochondrial mem- 
brane and is predicted to have five transmembrane 
domains [(78) and our analysis with Protter (20)], 
with its N terminus in the matrix and C terminus 
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in the intermembrane space (Fig. 2A) (27). As 
expected, in HeLa cells, FLAG-tagged SFXN1 
colocalized with the inner mitochondrial mem- 
brane protein COX4 (Fig. 2B), and endogenous 
SFXNI was enriched in mitochondria purified 
from Jurkat and K562 cells (fig. S5B). Using 
super-resolution microscopy, we confirmed 
that SFXN1 localizes to the inner and not outer 


Fig. 1. A genetic screen for 


mitochondrial membrane (Fig. 2C). The outer 
mitochondrial membrane, which can be marked 
by Tom20 (Fig. 2C), is permeable to most 
small metabolites owing to the presence of 
the VDAC porins. Given these attributes and its 
emergence from our screen, SFXN1 was an ex- 
cellent candidate to be a mitochondrial serine 
transporter. 


Uptake B 


If this were the case, cells lacking SFXN1 
should have metabolic defects similar to those 
of cells missing SHMT2 or MTHFD2, the en- 
zymes needed to convert serine to glycine and 
formate in mitochondria. Indeed, loss of SFXNI1, 
SHMT2, or MTHFD2, but not SHMTI, caused the 
depletion of glycine in Jurkat cells, an increase 
in the serine-to-glycine ratio, and a reduction in 
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media. 1C, one-carbon; PLP, pyridoxal phosphate. (E) SFXN1-null cells have 
a proliferation defect in the absence of serine, which is rescued by the 
addition of 1 mM formate to the media. Additional loss of SHMT1 exacerbates 
the proliferation defect in the Jurkat cells (mean + SD; n = 3; n denotes 
biological replicates; this is true for every main and supplementary figure 
except fig. S6D; ***P < 0.001, ****P < 0.0001; ns, not significant). AAVS1 
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indicates control cells that were treated with an sgRNA targeting the AAVS1 
locus as described previously (14). SFXN1- and SHMT2-null K562 cells are 
designated as -/-/- as they are triploid for these genes. (F) Expression of an 
sgRNA-resistant cDNA for SFXN1 in the SFXN1-null cells restores their pro- 
liferation rate in serine-deficient media (mean + SD; n = 3; **P < 0.01, ****P < 
0.0001). Two-tailed t tests were used for comparisons between groups. 
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the amount of de novo-synthesized glycine sec- 
reted into the media as measured in tracing ex- 
periments with labeled serine (Fig. 2, D and E, 
and fig. S3A). The null cells also had lower levels 
of the charged folate species that we were able 
to detect (5,10-methenyl-THF and 5-formyl-THF), 
and this was not secondary to a drop in THF or 
folate levels (Fig. 2F and fig. S3B). The sgRNA- 
resistant SFXNI cDNA complemented all the 
metabolic defects of the SFXNI-null cells. 

To determine if loss of SFXN1 affects path- 
ways that consume one-carbon donors, we ex- 
amined purine synthesis, as it uses large amounts of 
10-formyl-THF as a cofactor (Fig. 2G). Indeed, the 
purine synthesis intermediates 5'-phosphoribosyl- 
glycinamide (GAR), phosphoribosylamino- 
imidazolesuccinocarboxamide (SAICAR), and 
5-aminoimidazole-4-carboxamide ribonucleotide 
(AICAR) accumulated in SFXNJ-null cells to 
similar extents as in cells lacking SHMT2 and 
MTHFD2 (8) (Fig. 2H and fig. S3C). In this re- 
spect, loss of SFXN1 mimicked serine starvation, 
which in wild-type cells also caused an accumula- 
tion of the intermediates (fig. S3D). Consistent 
with SFXNI acting upstream of formate produc- 
tion, formate abolished the accumulation of the 
intermediates in the SEXN/-null cells but did not 
rescue the defects in glycine levels (Fig. 2, I and J). 

To monitor the relative contributions of the 
cytosolic and mitochondrial pathways to the 
synthesis of one-carbon units in SEXV7-null cells, 
we used a reported strategy to trace [2,3,3-"H3]- 
serine to thymidine triphosphate (TTP) (8, 22). 
The deuterated serine will give rise to TTP shifted 
by two mass units (TTP M+2) when catabolized 
through the cytosolic pathway but only by one 
mass unit (TTP M+1) if via the mitochondrial 
pathway (Fig. 2K). In wild-type cells, the M+1 
form was the predominant species of newly syn- 
thesized TTP, as expected in cells generating most 
of their one-carbon donors through mitochondria. 
By contrast, in cells lacking SFXN1, SHMT2, or 
MTHFD2, most of the newly synthesized TTP 
was of the M+2 species (Fig. 2L), consistent with 
SFXN1 being important for the function of the 
mitochondrial one-carbon pathway. 

To corroborate this conclusion, we used our 
metabolism-focused sgRNA library to screen for 
genes important for the optimal proliferation of 
SFXNI-null but not wild-type cells. Gratifyingly, 
among the top hits were many components of the 
cytosolic one-carbon pathway, including SHMT7, 
which was the most differentially required gene 
(Fig. 2, M and N, and fig. S3E). We conclude that 
SFXN1 is part of the mitochondrial one-carbon 
pathway and its loss, like that of established 
components, makes cells more dependent on the 
cytosolic branch of the pathway. 


SFXN1 transports serine in vitro 


To test if SFXN1 can directly transport serine, 
we purified the FLAG-tagged protein from mam- 
malian cells (fig. S4A) and reconstituted it into 
liposomes. Recombinant SFXN1 mediated serine 
uptake into liposomes (Fig. 3A). Both L- and p- 
serine competed with the transport of the labeled 
serine, as did other amino acids, including the 
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structurally related amino acids alanine, cysteine, 
and glycine, whereas other metabolites did so to 
negligible extents (Fig. 3B). Neither formate nor 
citrate competed with serine transport. In vitro, 
SFXNI transports serine with a Michaelis constant 
(Km) of ~170 uM (Fig. 3C), which suggests that 
SFXNI1 can transport serine at the estimated cel- 
lular serine concentration of 300 uM. Consistent 
with these findings, serine uptake by mitochon- 
dria isolated from SFXN7-null cells was reduced 
compared to that by wild-type mitochondria, 
whereas the uptake of the structurally unrelated 
amino acid glutamate was unaffected (Fig. 3D). 

Because alanine, cysteine, and glycine partially 
competed with serine in the in vitro transport 
assay, we tested whether SFXNI can also trans- 
port these amino acids. Indeed, SFXNI1 trans- 
ported alanine at the physiologically relevant 
concentration of 371 uM (Fig. 3E). We could not 
reliably measure cysteine and glycine transport 
by SFXN1 and suspect that further optimization 
of the assay might be necessary to determine 
whether SFXNI1 can also directly transport these 
amino acids. To begin to assess a potential role 
for SFXNI1 in the metabolism of these amino 
acids, we incubated cells in media with or with- 
out them. Although the presence or absence of 
alanine or glycine did not affect the prolifera- 
tion of SEXNI-null cells, they proliferated better 
than their wild-type counterparts in media with 
low concentrations of cystine, suggesting that 
SFXNI1 may play a role in intracellular cysteine 
transport (fig. S4C and fig. S5A) (see Discussion). 


Homologs of SFXN1 can compensate for 
its loss 


Cells lacking components of the mitochondrial 
one-carbon pathway are auxotrophic for glycine 
(6, 23-25), which we confirmed using our SHMT2- 
and MTHFD2-null cells (fig. S5A). Because loss 
of SFXNI1 did not cause glycine auxotrophy, we 
reasoned that there must be genes that can par- 
tially compensate for it. In mammals there are 
five sideroflexins, with SFXN3 being the closest 
homolog of SFXNI1 (88% protein sequence sim- 
ilarity) (Fig. 4A). Budding yeast has only one 
sideroflexin (FSF1), while Drosophila melanogaster 
has two, one most similar to SFXN1 and SFXN3 
(Sfxn1-3) and the other to SFXN2 (Sfxn2) (Fig. 4A). 
The human, fly, and yeast sideroflexins that we 
examined localized to mitochondria when ex- 
pressed in HeLa cells (Fig. 4D). 

Because Jurkat and K562 cells, like other 
commonly used cell lines, express multiple side- 
roflexins (Fig. 4, B and C), we hypothesized that 
one or more family members might partially 
compensate for the loss of SFXN1, thus explain- 
ing why the SFXN/-null cells are not auxotrophic 
for glycine. To explore this possibility, we used 
the metabolism-focused sgRNA library to screen 
for genes required for SEXN1-null Jurkat cells to 
proliferate in the absence of glycine. In addition 
to several genes in the mitochondrial (MTHFD2, 
SHMT2, MFT) and cytosolic (MTHFS) one-carbon 
pathways, the only other gene to score was SEXN3 
(Fig. 4E), suggesting that it has redundant func- 
tions with SFXN/1. 
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Indeed, like cells lacking SHM7T2 or MTHFD2, 
cells null for both SEXN/ and SFXN3 [SFEXNI&3 
DKO (double knockout) cells] did not prolifer- 
ate in the absence of glycine (Fig. 4F). Consistent 
with these cells having a severe defect in glycine 
synthesis, the addition of formate to the glycine- 
free media did not reverse their proliferation 
defect (Fig. 4F), whereas expression of either 
SFXN1 or SFXN3B did (fig. S5, D and E). Purine 
synthesis intermediates accumulated to greater 
extents in the DKO cells than in those lacking 
only SFXN1, whereas the single or combined 
deletions of SFXN3 and SFXN2 did not affect 
these metabolites (Fig. 4G). Compared to the 
SFXN1-null and wild-type cells, the DKO cells 
proliferated slowly even in full media, sug- 
gesting that beyond experiencing one-carbon 
unit stress, these cells have limiting amounts 
of one-carbon donors and/or glycine. 

Whereas mitochondrial mass, morphology, 
and function were not affected in SFXNI-null 
cells, SEFXNI&3 DKO cells did have defects in 
these parameters (fig. S6). Mitochondrial one- 
carbon metabolism, as well as serine itself, is 
needed for mitochondrial protein synthesis and, 
indeed, the DKO cells had reduced levels of mito- 
chondrially encoded proteins, likely explain- 
ing their mitochondrial dysfunction (fig. S6G) 
(26-29). Despite multiple attempts and the sup- 
plementation of full media with formate, we 
failed to isolate SFXN1&2&3 triple knockout cells, 
suggesting that such cells are not viable. How- 
ever, we were able to obtain one cell clone lacking 
SFXNI and SFXN2 and containing low levels of 
SFXN3 as a result of an in-frame deletion (fig. 
S5C), and these cells were also unable to prolif- 
erate in the absence of glycine (Fig. 4F). 

Because SFXN1 and SFXN3 are among the 
most highly expressed sideroflexins in Jurkat 
cells (Fig. 4B), we asked if other homologs can 
compensate for them if expressed at higher lev- 
els. With the exception of SFXN4, overexpres- 
sion of any of the human sideroflexins reversed 
the glycine auxotrophy of the Jurkat SFXNI&3 
DKO cells, as did heterologous expression of 
yeast FSF1 and Drosophila Sfxn1-3 and Sfxn2 
(Fig. 4H and fig. SSF). However, besides SFXN1 
and SFXN3, only SFXN2, FSF1, and Sfxn1-3 (the 
closest Drosophila homolog of SFXN1) amelio- 
rated, to differing degrees, the defects in purine 
synthesis (Fig. 41). These results suggest that 
serine transport is an evolutionarily conserved 
feature of the sideroflexins but that their kinetic 
properties and likely substrate specificities vary 
so that not all can support the high rate of mito- 
chondrial serine import required to fulfill the 
demand for one-carbon units of proliferating 
cells. In vitro transport assays optimized for each 
sideroflexin will be required to test this idea. 


Discussion 


Our work reveals SFXN1 as a previously miss- 
ing component of the one-carbon metabolism 
pathway that functions as a mitochondrial serine 
transporter. We propose that SFXN1 and SFXN3 
(and perhaps SFXN2) are the main mitochondrial 
serine transporters in human cells and that 
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Fig. 2. Loss of SFXN1 phenocopies mutants in mitochondrial 
one-carbon metabolism. (A) Model of the predicted topology of SFXN1 in 
the mitochondrial inner membrane. Transmembrane helices are indicated 
by numbers. IMS, intermembrane space. (B) FLAG-tagged SFXNI1 localizes 
to mitochondria. Wild-type HeLa cells transiently expressing FLAG-SFXN1 
were processed for immunofluorescence detection of the FLAG epitope 
(cyan) and the mitochondrial inner membrane marker cytochrome 

c oxidase subunit 4 (COX4) (magenta). The merged image shows the 
overlap of both channels in white. Scale bar is 10 um in the full image and 
2 um in the inset. (C) Super-resolution microscopy confirms SFXN1 
localization to the inner membrane of mitochondria. Wild-type HeLa cells 
transiently expressing FLAG-SFXNI1 were processed for immunofluores- 
cence detection of the FLAG epitope (magenta) and the outer mitochon- 
drial membrane marker Tom20 (left panel, green) or the mitochondrial 
inner membrane marker cytochrome c oxidase subunit 4 (COX4) (right 
panel, green) and imaged by STED microscopy. Overlap of magenta and 
green channels is shown in white, and line profiles show fluorescent 
signals of each channel across mitochondria where marked by the dotted 
rectangles. Scale bars are 2 um in the full images and 1 um in the insets. 
(D) As in cells lacking known components of the mitochondrial 
one-carbon pathway, glycine levels are reduced and the cellular serine/glycine 
ratio is increased in SFXN1-null cells. Serine and glycine levels were 
measured by gas chromatography—mass spectrometry (GC-MS) in 
extracts from wild-type Jurkat cells or single-cell-derived control and 
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knockout clones (mean + SD; n = 3; **P < 0.01, ***P < 0.001). (E) Loss of 
SFXNI1 causes a glycine synthesis defect. GC-MS was used to measure 
glycine in the culture media of wild-type Jurkat cells or single-cell—derived 
knockout clones incubated for 12 hours with 2,3,3-°H3-serine as the only 
serine source. The glycine M+O species is the unlabeled species. The 
glycine M+1 species is derived from 2,3,3-°H3-serine (mean + SD; n = 3; 
**P < 0.01, ***P < 0.001). (F) Levels of charged folate species are 
decreased in SFXN1-null cells. Metabolites were measured by LC-MS in 
extracts from wild-type Jurkat cells or single-cell-derived control and 
knockout clones (mean + SD; n = 3; **P < 0.01, ***P < 0.001, ****P < 
0.0001). 5,10-CH*-THF, 5,10-methenyl-THF. (G) Schematic of the purine 
synthesis pathway, indicating steps using one-carbon units in the form of 
10-formyl-THF. GAR, 5’-phosphoribosyl-glycinamide. SAICAR, phosphor- 
ibosylaminoimidazolesuccinocarboxamide. AICAR, 5-aminoimidazole-4- 
carboxamide ribonucleotide. IMP, inosine monophosphate. (H) The 
purine synthesis intermediates GAR, SAICAR, and AICAR accumulate in 
SFXNI1-null cells. Purine synthesis intermediates were measured by LC-MS 
in extracts from wild-type Jurkat cells or single-cell-derived control and 
knockout clones (mean + SD; n = 3; ****P < 0.0001). (1) Addition of 1 mM 
formate does not rescue glycine levels and serine/glycine ratio of SFXNI1- 
null cells. Serine and glycine levels were measured by LC-MS in extracts 
from wild-type Jurkat cells or single-cell-derived SFXNI-null cells 
incubated for 24 hours in the indicated media (mean + SD; n = 3; *P < 
0.05, **P < 0.01). (J) Addition of 1 mM formate reverses the accumulation 
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Sfxn1-3 also has this function in D. melanogaster. 
SFXNI and SFXN3B likely have other physiolog- 


ically relevant substrates besides serine, such as E 6000] —# Liposomes only 


alanine or cysteine, a notion supported by our 
in vitro transport results, the finding that cells 
lacking both have more severe proliferation de- 
fects than those missing established components 
of the mitochondrial one-carbon pathway, and 
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that cells lacking SFXN1 have a proliferation 0 
advantage in media containing low cystine, the 
oxidized dimer of cysteine present in RPMI me- 
dia and taken up by cells (fig. S4C). A major use Cc 
of cysteine is cytosolic glutathione synthesis, and 
it is possible that loss of SFXN1 and thus a reduc- 
tion in mitochondrial cysteine import should in- 
crease its availability for this use, which is known 
to be limiting for cell proliferation (30). Two re- 
ports proposed that SFXNI1 can transport citrate 
in vitro (31, 32), but the physiological relevance 
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of this remains unclear because SLC25A1 is well 
established as the mitochondrial citrate carrier 
(33-35) and citrate did not compete with serine in 
our in vitro assays (Fig. 3B). Moreover, these pre- E 
vious studies used purified endogenous rather 
than recombinant protein, raising the possibility 
that the observed activity was due to a copuri- 
fying contaminating protein. In addition, we 
excluded another potential substrate for the 
sideroflexins, pyridoxine (36), which is a precur- 
sor for the pyridoxal 5’-phosphate cofactor of 
SHMT2, ALAS2, and mitochondrial transaminases, 
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because the levels of pyridoxal-conjugated proteins 
were unchanged in the mitochondria of SEXNI- 
or SFXN1&3-null cells (fig. S6H). Furthermore, 
SLC25A39 is likely responsible for pyridoxal 5’- 
phosphate transport into mitochondria (37). 
When overexpressed, SFXN5 only partially 
complements loss of SFXNI1, and we suspect that 
its main function is not as a serine transporter— 
similarly to SFXN4, which in our experimental 
systems cannot substitute for SFXN1. It is inter- 
esting that budding yeast only has one sideroflexin 
(FSF1), perhaps suggesting that is has broader 
functions than its homologs in other species. 
Mice with a loss-of-function mutation in SfrnI 
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Fig. 3. SFXN1 transports serine in vitro. (A) Time course of radioactive serine uptake into 
proteoliposomes containing SFXN1. LAMP1-containing or empty liposomes were used as controls. 
Values are the averages of two replicates. Com, counts per minute. (B) Competition of serine 
uptake after 60 min by different metabolites at 500 uM (mean + SD; n = 3). (C) Steady-state kinetic 
analysis of SFXN1-mediated serine transport reveals a Vina, Of ~ 8.2 pmol/min and a K,, of 

~170 uM. Velocity, as shown, was calculated as a function of the serine concentration. Each data 
point was calculated from three replicate data points. (D) Radioactive serine uptake into 
mitochondria purified from SFXNI-null cells is reduced compared to that into mitochondria purified 
from wild-type cells, whereas glutamate uptake is unchanged (mean + SD; n = 3, **P < 0.01; 

ns, not significant). (E) Steady-state kinetic analysis of SFXN1-mediated alanine transport reveals a 
K,, of ~371 uM for alanine. Each data point was calculated from three replicate data points. 


have a sideroblastic-like anemia characterized by 
iron accumulation in mitochondria (J8, 39). Al- 
though SfxrniI has been challenged as the caus- 
ative gene (39), our work does provide a possible 
explanation for the phenotype. In humans, mu- 
tations that impair the part of the heme synthe- 


sis pathway that occurs in mitochondria, which 
requires glycine, cause sideroblastic anemia (40). 
Given that mitochondria make glycine from ser- 
ine, we speculate that in the SfrnJ-mutant mice, 
an insufficient import of serine into mitochon- 
dria results in a decrease in glycine and thus 
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heme synthesis. Iron itself is unlikely to be a di- 
rect substrate of SFXN1 as the mitoferrins are 
reported mitochondrial iron transporters (41). 
There are multiple sideroflexins, and their ex- 
pression varies across tissues (fig. S2 A, B). Like 
other genes of the mitochondrial one-carbon 


of purine synthesis intermediates in SFXNI-null cells. Intermediates were 
measured by LC-MS in extracts from wild-type Jurkat cells or single-cell— 
derived SFXN1-null cells incubated for 24 hours in the indicated media 
mean + SD; n = 3; ****P < 0.0001; N.D., not detected). (KK) Tracing 
strategy to differentiate contribution of cytosolic and mitochondrial 
pathways to cytosolic TTP synthesis. Oxidation of 2,3,3-°H3-serine by 
SHMT2 and subsequent enzymes in mitochondria gives rise to a singly 
abeled formate species, and thus singly labeled (one mass unit heavier, 
M+1) TTP. Oxidation by SHMT1 in the cytosol gives rise to doubly labeled 
two mass units heavier, M+2) TTP. The difference between unlabeled 
M+0), M+1, and M+2 TTP can be resolved on a high-resolution mass 
spectrometer. The ratio of M+1 to M+2 is indicative of the contribution of 
mitochondria- versus cytosol-derived one-carbon units to nucleotide 
synthesis. Adapted from (8). TTP, thymidine triphosphate. (L) The relative 
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contribution of the cytosolic and mitochondrial one-carbon pathways to 
TTP synthesis is inverted in SFXNI1-null compared to wild-type cells. 
Wild-type Jurkat or single-cell-derived knockout cells were cultured for 
12 hours in media containing 2,3,3-°H3-serine as the only serine source 
before harvesting and LC-MS analysis (mean + SD; n = 3, ****P < 0.0001). 
(M) Genes of the cytosolic one-carbon pathway are selectively required 
for the optimal proliferation of SFXNI-null cells. Gene scores in wild-type 
cells were plotted against those in SFXNI-null cells. Genes with a 
differential gene score of <-1.5 are shown in red. (N) Serine 
Hydroxymethyltransferase 1 (SHMT1) was the top hit from the SFXN1 
synthetic lethality screen. Genes were ranked according to differential 
gene score between wild-type and SFXNI1-null cells. Cyto 1C metabolism, 
cytosolic one-carbon metabolism; FA, fatty acid. Two-tailed t tests were 
used for comparisons between metabolites. 
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Fig. 4. SFXN3 and fly and yeast 

sideroflexin homologs can substitute dmSfxn2 

for SFXN1 loss. (A) Phylogenetic tree L__hsFxne2 

of human, Drosophila melanogaster, eee 

and Saccharomyces cerevisae hSFXN3 
sideroflexins. (B) mRNA levels of the meres 

five human sideroflexins in commonly serent 
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ilobase million) levels were extracted 
from the Cancer Cell Line Encyclopedia. 
C) Sideroflexin protein levels in 
commonly used cell lines. Cell lysates 
were equalized for total protein 
amounts and analyzed by immuno- 
blotting for the levels of the indicated 
proteins. (D) FLAG-tagged sideroflexin 
homologs localize to mitochondria. 
Wild-type HeLa cells transiently 
expressing FLAG-sideroflexin homologs 
were processed for immuno- E 
fluorescence detection of the FLAG 
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(E) CRISPR-Cas9-based genetic screen 
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of purine synthesis intermediates is wr 
exacerbated in cells lacking both SFXN1 

and SFXN3 compared to their single-knockout counterparts. The asterisk 
denotes a cell clone lacking SFXN1 and SFXN2 and with incomplete 
deletion of SFXN3. Purine intermediates were measured by LC-MS in 
extracts from the indicated cells (mean + SD; n = 3; **P < 0.01). 
Abbreviations as in Fig. 2C. (H) Human, yeast, and Drosophila sideroflexin 
homologs, with the exception of SFXN4, rescue the glycine auxotrophy 
of cells lacking both SFXN1 and SFXN3. Single cell-derived double- 
knockout Jurkat cells were transduced with an empty vector (EV) or 
cDNAs of human, yeast, and Drosophila sideroflexin homologs. Asterisks 
denote statistically significant differences in proliferation in media 
lacking glycine between the cells expressing the empty vector and the 
sideroflexin homologs. Mean + SD; n = 3; ***P < 0.001, ****P < 0.0001). 


SFXN1“3% 


Kory et al., Science 362, eaat9528 (2018) 16 November 2018 


WT SFXN1“37 


(1) Sideroflexin homologs rescue to varying degrees the purine synthesis 
defects of cells lacking SFXN1 and SFXN3. Purine intermediates were 
measured by LC-MS in extracts from wild-type Jurkat cells or the 
double-knockout Jurkat cells expressing an empty vector (EV) or cDNAs 
of human, yeast, and Drosophila sideroflexin homologs. Asterisks 
denote statistically significant differences between the cells expressing the 
empty vector and the sideroflexin homologs. Values were normalized to 
the average value of the wild-type samples in (G) because purine 
synthesis intermediates were not detected in the wild-type samples 

in this experiment (mean + SD; n = 3, **P < 0.01; N.D., not detected; 
N.S., not significant). Two-tailed t tests were used for comparisons 
between metabolites. 
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pathway, SFXNI, 2, and 3 expression is likely 
regulated by the Myc transcription factor, as 
we found 34, 20, and 14 Myc-binding sites in 
the promoters of SFXNI, SFXN2, and SFXN3, 
respectively. SFX] is expressed in many can- 
cers, most highly in leukemias and lymphomas 
(fig. S2C), Thus, SFXN1 and its homologs may 
turn out to be important nodes for regulating 
the fate of serine in cells and also play unexplored 
roles in cancer cell growth. 


Methods summary 


CRISPR screens performed in human SHMT1- 
null Jurkat and K562 cells identified SFXN1 as 
one of the most differentially scoring genes in 
media with and without serine in both cell lines. 
The proliferation and metabolism of single-cell- 
derived SFXN7-null cells generated using CRISPR- 
Cas9 was analyzed by using Cell Titer Glo assays and 
mass spectrometry-based metabolite profiling and 
compared with mutants of the known one-carbon 
metabolism genes SHMT1, SHMT2, and MTHFD2. 
The localization of SFXN1 and its homologs was 
analyzed by spinning disk confocal and stimulated 
emission depletion (STED) super-resolution mi- 
croscopy. In vitro transport assays and uptake 
assays into isolated mitochondria confirmed 
SFXN1 as a serine transporter. The redundancy 
of genes homologous to SFXN1 (the sideroflexins) 
was analyzed by using a SFXNI1 synthetic lethality 
CRISPR screen, by studying double deletion cells, 
and in complementation assays. 
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INTRODUCTION: A mechanistic understand- 
ing of brain function requires the identification 
of distinct cell types in the brain at a molecular, 
spatial, and functional level. The preoptic re- 
gion of the hypothalamus comprises multiple 
nuclei and controls many social behaviors and 
homeostatic functions. Discrete neuronal types 
within the preoptic region have been associated 
with specific hypothalamic behaviors and ho- 
meostatic controls, yet the organizational prin- 
ciples of the underlying circuits remain elusive. 
Further progress requires methods that can 
identify molecularly distinct cell types and 
map their spatial and functional organization 
in the tissue. 


Single-cell RNA sequencing 
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RATIONALE: Single-cell RNA sequencing 
(scRNA-seq) has revolutionized the understand- 
ing of many tissues by allowing a systematic, 
genome-wide molecular identification of cell 
types. However, scRNA-seq requires cell dissocia- 
tion, leading to a loss of spatial context that 
is essential to understand the cellular architec- 
ture of brain circuits. Image-based approaches 
to single-cell transcriptomics enables gene ex- 
pression profiling of individual cells within their 
native tissue and offers opportunities for simul- 
taneous in situ cell-type identification and spatial 
mapping, as well as functional characterization 
when combined with activity marker imaging. 
The combination of these complementary tech- 


Multiplexed Error Robust Fluorescence 
In Situ Hybridization (MERFISH) 


Spatial organization of cells in the preoptic region 


Male and female social behaviors 
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In situ single-cell profiling reveals the molecular and cellular organization of the 


hypothalamic preoptic region. The combination of MERFISH with scRNA-seq to profile the 
gene expression of 1 million cells in situ revealed ~70 neuronal populations in the preoptic 
region, each with distinct molecular signatures and spatial organizations, providing insights 
into neuromodulatory signaling pathways. Further combination with activity marker imaging 
led to the identification of discrete neuronal types activated by key social behaviors, including 
parenting, aggression, and mating. 
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niques would allow us to generate a molecular 
inventory of neuronal types while mapping their 
spatial and functional organization. 


RESULTS: We combined scRNA-seq and multi- 
plexed error robust fluorescence in situ hybrid- 
ization (MERFISH), a single-cell transcriptome 
imaging method, to investigate the molecu- 
lar, spatial, and functional organization of the 
mouse hypothalamic preoptic region. We pro- 
filed ~31,000 cells using scRNA-seq and imaged 
~1.1 million cells within intact tissues using 

MERFISH. Our data re- 
vealed a remarkable di- 
Read the full article  Versity of neurons in this 
at http://dx.doi. region, comprising ~70 dif- 
org/10.1126/ ferent neuronal popula- 
science.aau5324 tions, many of which were 
previously unknown. These 
neuronal types exhibited distinct neuromod- 
ulatory signatures and revealed a striking het- 
erogeneity within cell populations that were 
previously thought to be functionally unitary. 
MERFISH measurements further allowed us to 
map the spatial organization of these neuronal 
types, determine the cellular composition of 
distinct nuclei, and provide insights into the 
functional organization of neuron populations, 
including topographical relationships that un- 
derlie sex hormone signaling. 

Last, we combined MERFISH with immediate- 
early-gene expression imaging to identify spe- 
cific neuronal populations activated by social 
behaviors, including parenting, mating, and 
aggression. Several neuronal populations were 
selectively activated in each of these behaviors, 
supporting the notion that transcriptionally dis- 
tinct neuronal types control specific hypo- 
thalamic functions. We identified a core neuronal 
population activated in all animals that exhibit 
parenting, as well as cell populations differen- 
tially activated in mothers and fathers, pro- 
viding insights into how physiological state 
may affect parental behavior. Moreover, we 
identified cells associated with sexual behavior 
in males and females as well as male aggression 
toward infants and conspecific males. 


CONCLUSION: By combining MERFISH with 
scRNA-seq, we have revealed the molecular, spa- 
tial, and functional organization of neurons 
within the hypothalamic preoptic region. These 
results provide a framework for mechanistic 
investigation of behavior circuits with high mo- 
lecular and spatial resolution and opens avenues 
for identifying and mapping cell types in a di- 
verse range of tissues and organisms. 
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The hypothalamus controls essential social behaviors and homeostatic functions. 
However, the cellular architecture of hypothalamic nuclei—including the molecular identity, 
spatial organization, and function of distinct cell types—is poorly understood. Here, 

we developed an imaging-based in situ cell-type identification and mapping method and 
combined it with single-cell RNA-sequencing to create a molecularly annotated and spatially 
resolved cell atlas of the mouse hypothalamic preoptic region. We profiled ~1 million cells, 
identified ~70 neuronal populations characterized by distinct neuromodulatory signatures 
and spatial organizations, and defined specific neuronal populations activated during 
social behaviors in male and female mice, providing a high-resolution framework for 
mechanistic investigation of behavior circuits. The approach described opens a new avenue 
for the construction of cell atlases in diverse tissues and organisms. 


mechanistic understanding of brain func- 

tion requires a systematic assessment of 

cell types and their spatial organization, 

connectivity, and functional properties. 

A case in point is the preoptic region of 
the hypothalamus, which comprises multiple 
nuclei and controls essential social behaviors 
such as parenting, mating, and aggression as 
well as homeostatic functions such as thermo- 
regulation, thirst, and sleep (/, 2). Because these 
are evolutionarily conserved functions, it has been 
proposed that the associated neural circuits are 
genetically defined and thus composed of tran- 
scriptionally distinct neuronal types (7-3). Indeed, 
several neuronal populations within the pre- 
optic region, each defined by discrete molecular 
markers, have been linked to distinct behavioral 
and homeostatic functions (4-17). However, the 
number of cell types present in the preoptic re- 
gion as well as their molecular signatures, spa- 
tial organizations, and functional roles remain 
unclear, hampering our ability to investigate the 
underlying neural circuits. 
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Single-cell RNA-sequencing (scRNA-seq) pro- 
vides a powerful means for the identification 
of cell types and cell states through genome-wide 
expression profiling of individual cells, offering 
rich insights into the cellular diversity of many 
tissues, including the brain (12-15). However, 
scRNA-seq requires cell dissociation and thus 
results in the loss of the spatial context of cells 
that is critical for understanding tissue func- 
tion (5, 16). Recently, image-based single-cell 
transcriptomic approaches have been developed 
that quantify gene expression by directly imag- 
ing individual RNA molecules within intact cells 
and tissues with multiplexed fluorescence in 
situ hybridization (FISH) or in situ sequencing 
(15, 17-22). These approaches offer new oppor- 
tunities to identify cell populations within com- 
plex tissues while simultaneously mapping their 
spatial organization and uncovering their func- 
tions by combining gene expression profiling 
with imaging of activity markers, such as the in- 
duction of immediate early genes (IEGs) (22, 23). 
Among these, multiplexed error-robust FISH 
(MERFISH) detects individual RNA molecules 
with single-molecule FISH (smFISH) (24, 25) and 
uses error-robust barcoding, combinatorial label- 
ing, and sequential imaging to multiplex smFISH 
measurements, enabling transcriptome-scale RNA 
imaging of individual cells in situ (20, 26). 

We developed a MERFISH-based imaging and 
analysis platform for in situ cell-type identifi- 
cation and mapping and used this approach, in 
combination with scRNA-seq, to create a cell 
atlas of the preoptic region of the mouse hypo- 
thalamus. We used scRNA-seq to catalog cell 
populations and identify their marker genes. 
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We then performed MERFISH imaging of these 
marker genes together with genes of known 
functional importance to identify cell popula- 
tions and map their spatial organization in situ. 
Last, we combined MERFISH with measure- 
ments of IEG expression in order to identify 
discrete cell populations activated by specific so- 
cial behaviors—including parenting, aggression, 
and mating—in both sexes and different physio- 
logical states. 


Results 
scRNA-seq of the preoptic region 


We dissected a rostral part of the mouse hy- 
pothalamus that contains the preoptic region 
(Fig. 1A)—the medial preoptic area (MPOA) and 
surrounding nuclei (~2.5 by 2.5 by 1.1 mm, Bregma 
+0.5 to -0.6)—from adult female and male brains 
and dissociated the tissue using a custom protocol 
that improved cell survival and capture (fig. S1). 
We collected scRNA-seq profiles from 31,299 cells 
across three replicates of each sex using droplet- 
based scRNA-seq (27-29). 

We used unsupervised, graph-based, community- 
detection methods (28, 30, 31) modified by us 
(fig. S2) to cluster cells (29). This led to the de- 
lineation of major cell classes, including inhibitory 
and excitatory neurons, microglia, astrocytes, im- 
mature oligodendrocytes (newly formed oligoden- 
drocytes and oligodendrocyte progenitor cells), 
mature oligodendrocytes, ependymal cells, endo- 
thelial cells, fibroblasts, macrophages, and mural 
cells, as well as subdivisions within these cell classes 
(Fig. 1B and table S1). 

Further clustering of inhibitory neurons (15,042 
cells) and excitatory neurons (3511 cells) separately 
revealed 43 and 23 subpopulations, respectively 
(Fig. 1B; fig. S3, A and B; and tables S1 and S2). 
Hereafter, we denote excitatory and inhibitory 
neuronal clusters as el, e2, ..., and il, i2, ..., re- 
spectively. We also provide specific names for 
these clusters based on marker genes (Fig. 1, C 
and D, and figs. S4 and S5, the latter emphasiz- 
ing neuropeptide expression) (29). 

Although the majority of the identified clus- 
ters expressed either excitatory or inhibitory neu- 
ronal markers, we observed expression of the 
y-aminobutyric acid (GABA) synthetic genes 
Gadi and Gad2 in many excitatory neuronal 
clusters classified on the basis of expression of 
Vglut2 (Slc17a6), with Gad2 expression being 
particularly widespread (fig. S3C). By contrast, 
very few Slcl7a6-positive clusters expressed the 
GABA transporter gene Vgat (Slc32al1). These 
data suggest that Slc17a6 and Slc32al are bet- 
ter discriminators for excitatory versus inhib- 
itory neurons, corroborating evidence from other 
brain areas (32). Cells in two neuronal clus- 
ters originally designated as inhibitory and one 
originally designated as excitatory coexpressed 
Slc17a6 (or Slc17a8, vGlut3) and Slc32al1. These 
cells were unlikely to be a clustering artifact be- 
cause individual cells coexpressed both markers, 
nor did they correspond to doublets (29); hence, 
they potentially represent hybrid neurons capa- 
ble of GABA/glutamate corelease, as character- 
ized in the hypothalamus and a few other brain 
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regions (32-34). We denote these clusters as hl, 
h2, and h3 (Figs. 1, C and D, and fig. S3C). 
To determine the gene categories that best dis- 
criminate neuronal clusters, we examined the top 
five most differentially expressed genes in each 
cluster and observed enrichment for neuropep- 
tides and molecules involved in neuromodulator 
production and transport, as well as for tran- 
scription factors, but not for neuromodulator 
(neuropeptide and hormone) receptors. Quan- 
titative analyses of enrichment profiles of these 
three gene classes among differentially expressed 
genes further support this notion (Fig. 1E, fig. 
S6, and table S3). Neuromodulator receptors did 
discriminate some clusters (for example, Npr1, 
Rxfpl, Brs3, and Drdl) (Fig. 1, C and D, and figs. 
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i25:Npy/Etv1 


i21:Sst/Pou3f3 


i45:BdntiPmaipt 


| 


S4 and S5). However, on average, neuromodu- 
lator receptors were expressed more widely and 
at lower levels than neuromodulators and tran- 
scription factors, limiting their use as potential 
markers for functional studies. Most clusters 
were discriminated by combinations of genes 
rather than by single markers. 

Hierarchical tree analyses (29) showed that 
inhibitory neuronal clusters that express a com- 
mon neuromodulator were often grouped together 
on the tree—for example, clusters expressing Avp, 
Gal, Crh, Tacl, and Sst (Fig. 1C)—suggesting po- 
tential functional or developmental commonal- 
ity among them. By contrast, neuromodulators 
largely failed to group excitatory neuronal clus- 
ters (Fig. 1D). Instead, predicted locations of in- 
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Fig. 1. scRNA-seq of the preoptic region in the 
mouse hypothalamus. (A) Schematic of the 
preoptic region of the hypothalamus. Magenta 
boxes indicate the area dissected for scRNA-seq 
(Bregma +0.5 to —0.6). (B) t-distributed sto- 
chastic neighbor embedding (tSNE) for all cells 
and inhibitory and excitatory neurons, with cells 
colored by cluster. Numbers superimposed on 

the tSNE indicate the cluster ID. Total cell numbers 
for each tSNE plot are indicated. NFO, newly 
formed oligodendrocytes; OPC, oligodendrocyte 
progenitor cells; MO, mature oligodendrocytes. 
(C) Heat map of z-scores of expression for select 
genes within inhibitory neuronal clusters. Clusters 
are organized on the basis of the hierarchical tree 
constructed with expression in principal compo- 
nent space, with some of the genes differentially 
expressed between branches indicated (blue). The 
nomenclature of clusters uses a numeric indicator 
of excitatory or inhibitory cluster followed by one or 
two marker genes, with the first marker typically a 
neuromodulator (29). Inhibitory and excitatory 
clusters that lack a notable neuromodulator marker 
gene were designated as Gaba and Glut, respec- 
tively, with an additional marker gene to help 
differentiate among these clusters when possible. 
Cluster names are colored according to the first 
gene. Predicted anatomical locations for the clusters 
are listed on the tree, and the unlabeled lines indi- 
cate that such prediction was not possible. Thick 
black lines underscore clusters grouped by common 
neuropeptide expression. (D) As in (C) but for 
excitatory neurons. The hybrid neuronal clusters 
1/h2 and h3 are listed in (C) and (D), respectively, 
because they were initially classified as inhibitory 
and excitatory, respectively. (E) —logio(P value) for 
the enrichment of gene categories in differentially 
expressed genes that mark neuronal clusters calcu- 
lated based on a gene-set enrichment analysis as 
shown in fig. S6. *P < 0.05. 


= 


dividual clusters on the basis of spatial expression 
patterns of their marker genes observed in the 
Allen Brain Atlas (35) and our own in situ hy- 
bridization data (fig. S7) suggest that excitatory 
clusters tended to be grouped on the tree by an- 
atomical structures or nuclei (Fig. 1D). For example, 
markers of clusters e4, e2, e21, h3, and el7 defined 
a node in the tree located in the PVN and adja- 
cent nuclei (MPN, PaAP, BAC, and BNST), markers 
of node-sharing clusters e13 and e7 placed these 
populations in the MnPO/AvPe/VMPO region, 
whereas markers of 12, e6, e5, and el placed these 
cells in the MPN/MPA region (Fig. 1D) (full names 
of the nuclei described in this work are provided 
in table S4). We thus hypothesize that excit- 
atory neuron types tend to be spatially segregated 
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Fig. 2. scRNA-seq identifies subdivisions of cells that express markers 
previously associated with single neuronal populations. (A to C) Ex- 
pression distributions of selected marker genes and genes of interest in all 
neuronal clusters that are statistically enriched [Model-based Analysis of 
Single-cell Transcriptomics (MAST) (75), false discovery rate <0.01] in (A) 
galanin (Gal), (B) tyrosine hydroxylase (Th), or (C) Bdnf and Adcyapl. 
Gene names in black indicate differentially expressed genes for each se- 


in distinct anatomical structures of the preoptic 
region, in a manner similar to the spatial segrega- 
tion of different types of excitatory neurons in var- 
ious layers of the cortex (36). Similar analysis with 
the inhibitory neuronal tree suggests that although 
some groups of clusters were defined by spatially 
restricted transcription factor expression—for 
example, Six6 marking the SCN (Fig. 1C)—such 
spatial grouping of transcriptionally similar clus- 
ters appeared to be less pronounced than with 
excitatory clusters. Additionally, transcription 
factors tended to mark groups of neuronal clus- 
ters further subdivided by neuromodulator ex- 
pression (Fig. 1, C and D), which is consistent 
with earlier reports of hypothalamic parcella- 
tion by transcription factors during early devel- 
opment (37). 


Specific neuronal clusters identified 
with scRNA-seq 


Previous studies of the preoptic region have de- 
fined cell populations associated with the reg- 
ulation of specific homeostatic and behavioral 
functions on the basis of the expression of one 
or more marker genes (table S5). Clusters that 
express these marker combinations were iden- 
tified in our scRNA-seq data (figs. S4 and S5), 
together with many previously unknown cell 
populations. Moreover, we uncovered a high level 
of molecular heterogeneity among a number of 
previously reported singular cell types, thus par- 
titioning them into multiple distinct populations, 
as illustrated below on specific examples. 
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The neuropeptide galanin (Gal) has been as- 
sociated with behaviorally relevant cell popu- 
lations of the preoptic region (4, 5, 38) in the 
MPOA (parenting and feeding) (5, 38) and VLPO 
(sleep) (4). Our scRNA-seq data revealed seven 
neuronal clusters that were statistically enriched 
in Gal expression, each characterized by distinct 
marker genes (Fig. 2A) validated with two-color in 
situ hybridization (fig. S7A). These clusters were 
each associated with different hormonal modu- 
lations, ranging from cluster i20:Gal/Moxd1, pre- 
dicted to lie in the sexually dimorphic nucleus 
of the POA (Fig. 1C) and expressing a wide range 
of sex steroid and neuropeptide receptors, to 
cluster e24:Gal/Rxfp1, expressing no sex steroid 
receptor (Fig. 2A). 

Second, cells that express tyrosine hydroxy- 
lase (Th), a key enzyme involved in catechol- 
amine synthesis, have been viewed as a single 
population involved in several social behaviors 
(6, 39). We identified six Th-enriched neuronal 
clusters (Fig. 2B and fig. S7B), among which 
only i16:Gal/Th and i38:Kiss1/Th expressed both 
Dopa decarboxylase (Ddc) and the vesicular mono- 
amine transporter Vmat2 (Slc18a2), genes required 
for dopaminergic function (Fig. 2B). 

Last, the neuropeptide adenylate cyclase acti- 
vating polypeptide 1 (Adcyap1) and brain-derived 
neurotrophic factor (Bdnf) have recently been 
identified as combined markers for preoptic neu- 
rons sensing warm temperature (8). Our data 
revealed nine Adcyap1- and Bdnf-enriched clusters 
(Fig. 2C). Although the warm-sensitive neurons 
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lected neuronal cluster. Gene names in blue indicate inhibitory (Gad1, Gad2, 
Slc32al) and excitatory (Slcl7a6) neuronal markers, as well as dopaminergic 
markers (Ddc, Slc6a3, and Slcl8a2). Gene names in green indicate sex hormone 
receptors. The y axis on each violin plot depicts the log transformed counts 
with the range set to the 95% expression quantile of the cluster with the highest 
expression (29). The sizes of red, cyan, and yellow circles correspond to the 
cell abundance of the inhibitory, excitatory, and hybrid clusters, respectively. 


have been previously considered as inhibitory 
neurons on the basis of their functional properties 
and expression of Gad2, all nine Adcyap1- and 
Bdnf-enriched clusters identified here coexpressed 
Gad2 and Slcl7a6, and only one of them also ex- 
pressed Slc32al (Fig. 2C), identifying these clus- 
ters as excitatory or hybrid neurons. We further 
identified one of these clusters as representing 
warm-sensitive neurons with the help of MERFISH. 
A recent study has revealed that a neuronal pop- 
ulation that controls thirst-motivated behavior 
also expresses Adcyap1 and Bdnf (0), further sup- 
porting the notion that Adcyap1 and Bdnf are 
imperfect markers for warm-sensitive cells. 


MERFISH measurements 
of the preoptic region 


Next, we performed MERFISH measurements of 
the preoptic region (1.8 by 1.8 by 0.6 mm, Bregma 
+0.26 to -0.34), within the area characterized 
with scRNA-seq, targeting a set of 155 genes 
(Fig. 3A and table S6) (29). These genes were 
composed of two groups: (i) 85 preselected genes 
that were either known markers for major cell 
classes or relevant to neuronal functions of the 
hypothalamus, such as neuropeptides and neuro- 
modulator receptors, and (ii) 70 additional genes 
that were identified with scRNA-seq as neuronal 
cluster markers but not already included in the 85 
preselected genes. Among these 155 genes, 135 genes 
were imaged by using combinatorial smFISH with 
an error-robust barcoding scheme, as demon- 
strated previously for MERFISH (20, 26, 40). The 
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Fig. 4. Neuronal clusters in the preoptic region as revealed with MERFISH. (A and B) z-scores 
of expression profiles for (A) inhibitory and (B) excitatory neuronal clusters identified with MERFISH. 
Depicted are 100 random cells from each cluster. The neuronal clusters are organized on the basis of 
similarity in their expression profiles, as depicted by the dendrogram. The sizes of red, cyan, and 
yellow circles indicate the abundance of neuronal clusters, and only clusters with more than 100 cells 
are depicted. H-1 is grouped with the inhibitory clusters because it was initially classified as inhibitory 
neurons. (C) The pairwise Pearson correlation coefficients between the expression profile (in z-score) of 
the MERFISH and scRNA-seq clusters. The order of the clusters in (C) is not the same as in (A) 
and (B). (D) As in (C) but with only scRNA-seq cluster(s) most similar to each MERFISH cluster shown, 
identified as the cluster(s) with the highest Pearson correlation coefficient(s) (fig. S14 and table S9) 
(29). When multiple scRNA-seq clusters show statistically indistinguishable, highest correlation 
coefficients to a MERFISH cluster (29), all of them are indicated. sCRNA-seq clusters outside the region 
imaged with MERFISH, as assessed by the expression patterns of the marker genes in the Allen 
Brain Atlas (35) and our own in situ data (fig. S7) (29), are excluded from this analysis (29). (E) Same as 


(D) but for clusters enriched in galanin (Gal). 


remaining 20 genes were relatively short and/or 
expressed at high levels, which is challenging 
for combinatorial smFISH detection, and hence 
were measured in sequential rounds of multicolor 
FISH after the combinatorial run. The sexually 
dimorphic expression previously reported for 11 
genes (41, 42) was confirmed here (fig. S8). 

We sectioned the preoptic region into 60 evenly 
spaced slices along the anterior-posterior axis 
and performed three-dimensional MERFISH im- 
aging on every fifth slice (29). Individual RNA 
molecules were clearly detected and identified 
(fig. S9), and individual cells were segmented 
based on 4’,6-diamidino-2-phenylindole (DAPI) 
and total mRNA staining (fig. S10) (29). In total, 
we profiled >400,000 cells from three to four 
replicates in naive male and female animals, 
as well as >500,000 additional cells from three 
to five replicates of animals subjected to be- 
havioral stimuli (29). MERFISH expression data 
showed high reproducibility between replicates 
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(fig. SILA), good correlation with bulk RNA-seq 
data of the preoptic region (43) (fig. SIIB), and 
a low false-detection rate (fig. S11C). For the tar- 
geted genes, MERFISH detected on average six- 
to eightfold more transcript copies per cell than 
did scRNA-seq (fig. S12, A to D), underscoring the 
high sensitivity of MERFISH. 

We used an unsupervised, community-detection- 
based clustering approach similar to that applied 
to scRNA-seq data to identify transcriptionally 
distinct cell populations in MERFISH data (Fig. 3, 
B and C, and table S7) (29). MERFISH identified 
all major cell classes (Fig. 3, B and C), except for 
macrophages and fibroblasts, potentially because 
the corresponding marker genes were not included 
in the MERFISH gene library. The expression 
profiles of cell classes measured with MERFISH 
were strongly correlated with those determined 
by using scRNA-seq (Fig. 3D). However, the re- 
lative abundance of cells in various cell classes 
differed in the two datasets (fig. SI2E). In partic- 
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ular, astrocytes, endothelial cells, and ependymal 
cells were depleted in our scRNA-seq data, presum- 
ably because of cell loss during tissue dissociation. 

MERFISH also provided a direct measurement 
of the spatial distribution of major cell classes. 
As expected, mature oligodendrocytes were 
enriched in the anterior commissure and the 
fornix-major myelinated fiber tracts of the rostral 
hypothalamus, whereas immature oligodendro- 
cytes, astrocytes, microglia, and endothelial cells 
were dispersed throughout (Fig. 3, E and F). 
Ependymal cells formed a single layer lining the 
more caudal aspects of the third ventricle, and 
mural cells were organized in vermiform structures 
that resemble blood vessels (Fig. 3, E and F). 
Notably, inhibitory and excitatory neurons exhib- 
ited distinct distributions (Fig. 3, E and F). Inhib- 
itory neurons, the more abundant neuronal type 
in the preoptic region, were widely dispersed across 
this region but enriched in specific posterior 
nuclei, including the BNST and MPN. By contrast, 
excitatory neurons were specifically enriched in a 
few nuclei anteriorly but became more dispersed 
posteriorly and, in agreement with previous re- 
ports (44), were depleted in the posterior BNST. 


MERFISH analyses of specific 
neuronal types 


Clustering analyses of inhibitory neurons and 
excitatory neurons separately identified ~40 in- 
hibitory and ~30 excitatory neuronal populations 
(Fig. 4, A and B, and tables S7 and S8). We in- 
vestigated the impact of the number of genes 
used to cluster cells in MERFISH data and found 
that ~90% of the identified neuronal clusters were 
recovered by using the ~75 genes that were most 
informative among the 155 (fig. S13). Beyond this 
point, cluster recovery increased more slowly with 
the number of genes added (fig. S13). Hereafter, 
we denote excitatory and inhibitory neuronal clus- 
ters identified with MERFISH as E-l, E-2, ... and 
I-1, I-2, ..., respectively, and the one identified 
hybrid cluster as H-1. 

The expression profiles of most neuronal clus- 
ters determined with MERFISH correlated well 
with those of scRNA-seq clusters (Fig. 4C and 
fig. S14, A and B). This observation allowed us to 
infer, for each MERFISH cluster, the putative cor- 
responding or most similar scRNA-seq cluster(s), 
defined as the cluster(s) with the highest corre- 
lation coefficient(s) (Fig. 4D, fig. S14C, and table 
S9) (29), which could help expand our knowl- 
edge of the expression profiles of the MERFISH 
clusters. Similar correspondence was observed by 
using a neural network classifier (fig. S14, D and 
E). Correlations between MERFISH and scRNA- 
seq clusters were only moderately weaker than 
those between scRNA-seq clusters derived from 
bootstrapped replicates (fig. S14, F and G). Many 
MERFISH clusters had a distinct, most similar 
scRNA-seq cluster (Fig. 4D, fig. S14, and table S9). 
However, in some instances, multiple MERFISH 
clusters exhibited the highest correlation to the 
same scRNA-seq cluster; in addition, a small frac- 
tion of MERFISH clusters lacked a statistically 
significant correlation to any scRNA-seq cluster 
(Figs. 4D, fig. S14, and table S9). Both of these 
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scenarios suggest that some clusters identified 
with MERFISH were not discriminated by scRNA- 
seq. Conversely, a small fraction of scRNA-seq clus- 
ters lacked a statistically significant correlation to 
any MERFISH cluster (Fig. 4D, fig. S14, and table 
S9), suggesting that these clusters were not iden- 
tifiable by the MERFISH gene panel or were lo- 
cated outside the MERFISH-imaged area (29). 

As a specific illustration, MERFISH identified 
10 clusters enriched in Gal expression, some show- 
ing one-to-one correspondences to Gal-enriched 
scRNA-seq clusters (Fig. 4E; fig. S15, A and B; and 
table S9). We also observed instances in which 
two Gal-enriched MERFISH clusters putatively 
corresponded to the same Gal-enriched scRNA-seq 
cluster (for example, I-14 and I-16 to i16) (Fig. 4E 
and table S9), suggesting that MERFISH resolved 
subpopulations within the scRNA-seq cluster. In- 
deed, we identified two subsets of cells within the 
scRNA-seq cluster i16, each respectively express- 
ing markers of the MERFISH clusters I-14 and 
1-16 (fig. S15, C and D). The calcitonin receptor 
(Calcr)- and bombesin receptor (Brs3)-positive 
I-14 and the Th-positive I-16 were found to be 
differentially activated in specific social behav- 
iors, as described later (table S9), supporting the 
resolution of these cells into two distinct popula- 
tions. We also observed a similar resolution of i8 
into I-7 and I-31 (Fig. 4E; fig. S15, E and F; and 
table S9). A Gal-enriched scRNA-seq cluster could 
also be split into Gal-enriched and non-Gal-enriched 
MERFISH clusters [for example, i20 into Gal- 
enriched cluster I-34 and non-Gal-enriched clus- 
ters I-2 and 1-32 (fig. S15, G and H, and table S9)]. 

Examination of the MERFISH or scRNA-seq 
clusters that were not discriminated by the other 
method showed several trends. Some of the 
MERFISH clusters not detected with scRNA-seq 
had relatively low abundance and thus might not 
be sufficiently represented in our scRNA-seq data, 
which profiled 4% as many neurons as we did in 
MERFISH. Some of the MERFISH clusters not 
discriminated by scRNA-seq had lowly expressed 
marker genes, which may not be reliably detected 
with scRNA-seq. Conversely, some scRNA-seq clus- 
ters not identified with MERFISH had marker 
genes that were not included in the MERFISH 
gene library. Some of the extremely low-abundance 
MERFISH or scRNA-seq clusters that lack cor- 
respondence may not represent well-identified 
clusters. These results thus demonstrate the com- 
plementary nature of MERFISH and scRNA-seq 
and an increased ability to characterize cell pop- 
ulations when both approaches are combined. 
Nevertheless, some clusters still exhibited hetero- 
geneity in gene expression associated with dis- 
tinct spatial locations (fig. S16), suggesting either 
spatial gradients in gene expression within the 
same cluster or the presence of unresolved cell 
subpopulations. 


Spatial organization of specific 
neuronal cell types 


Next, we examined the spatial distributions of 
individual neuronal clusters (Fig. 5A and figs. S17 
and $18) within the framework of major ana- 
tomically defined nuclei of the preoptic region 
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Fig. 5. The spatial organization of neuronal clusters in the preoptic region. (A) Spatial dis- 
tribution of example neuronal clusters that are localized (top and middle) or dispersed (bottom). 
Depicted are six of the 12 slices imaged from a female mouse. Colored markers indicate cells of 
the specified neuronal clusters, and gray markers indicate all other neurons. Nuclei boundaries 
depicted in light gray are drawn according to (45) and aligned to the tissue slices according to the 
locations of landmarks, such as the anterior commissure, fornix, and ventricle. The O, 100, 200, 
300, 400, and 500 um labels indicate the distance from the anterior position (Bregma +0.26). 

(B) Illustration of major hypothalamic nuclei spanning the imaged region and colored according to 
legend on the right (45). Nuclei abbreviations are as defined in Fig. 3F, and additionally, BAC, bed 
nucleus of the anterior commissure; LPO, lateral preoptic area; MPA, medial preoptic area; 

PS, parastrial nucleus; StHy, striohypothalamic nucleus; SHy, septohypothalamic nucleus; 

ACA, anterior commissure; Fx, fornix; 3V, third ventricle. Bregma locations are listed on top and 
the map at Bregma —0.22 is duplicated. (©) Summary of nuclei in which inhibitory (blue) or 
excitatory (green) neuronal clusters are enriched. Translucent horizontal bars indicate nuclei that 
contain only inhibitory (blue) or excitatory (green) clusters. Vertical pink bars highlight clusters 
primarily enriched in single nuclei. BNST-mal, BNST, medial division, anterolateral part; BNST-mv, 
BNST, medial division, ventral part; BNST-p, BNST, posterior part. (D and E) Analysis of spatial 
mixing of distinct neuronal clusters. We define the complexity of the neighborhood surrounding any 
given neuron as the number of distinct neuronal clusters present within that neighborhood, and 
the purity of that neighborhood as the fraction of all cells within the given neighborhood that are part 
of the most abundant cluster. Probability distributions of the complexity (D) and purity (E) of the 
100-um-radius neighborhood surrounding any given neuron are depicted. 
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Fig. 6. Spatial and molecular organization of neuronal clusters enriched 
in genes relevant to social behaviors. (A) Expression distributions of 
selected marker genes and genes of interest for neuronal clusters enriched in 
aromatase (Cyp19al). Expression distributions are calculated as in Fig. 2. 
(B) Spatial distributions of neuronal clusters depicted in (A). Two of the 

12 slices from a female mouse sample are depicted. Nuclei boundaries depicted 
in light gray are as defined in Fig. 5A. (C and D) As in (A) and (B) but for 
clusters enriched in estrogen receptor oa (Esr1). (E) Schematic showing the 
nuclei spanned by individual clusters, as indicated by the color subdivisions 
of the rectangles, colored identically to the nuclei abbreviations listed below. 
The nuclei abbreviations are as defined in Figs. 3F and 5B. I-17 is not colored 
because it was found at the edge of our imaged region and falls outside of 
the boundaries of the nearest imaged nuclei, the VLPO (table S9). (F) Aver- 
age overlap fraction between aromatase-enriched clusters and Esrl-enriched 
clusters for all measured animals. Cluster |-24 is enriched in both aromatase 


and Esr1 and listed only once. (G) Models of autocrine and paracrine signal- 
ing. Circulating testosterone (gray T) can activate cells expressing androgen 
receptor (Ak) or, in cells expressing aromatase, can be converted to estrogen 
(orange E). Autocrine: In cells co-expressing aromatase and Esr1, estrogen 
produced in these cells can activate estrogen receptor (ERa) in the same cells. 
Paracrine: Estrogen produced by aromatase-enriched cells can activate ERa in 
nearby cells enriched with Esrl. (H) Comparison of the fraction of cells that 
belong to the specified neuronal clusters (I-15 or |-2) for all male (blue) and 
female (red) replicates as a function of the anterior-posterior position of the 
slices. Above each panel are the spatial distribution of the cluster in four slices 
from a single female (red) and male (blue) replicate. (and J) As in (A) and (B) 
but for clusters enriched in oxytocin receptor (Oxtr). (K and L) As in (A) and (B) 
but for a cluster enriched in gonadotropin releasing hormone 1 (Gnrhl). 
MERFISH revealed 8, 15, and 19 aromatase-, Esrl- and Oxtr-enriched clusters, 
respectively, with only the seven most enriched clusters depicted for each. 


as depicted in Fig. 5B (45). About 30% of the 
MERFISH clusters were enriched primarily in a 
single nucleus (Fig. 5C, pink shading)—for ex- 
ample, cluster I-5 primarily in the VLPO and cluster 
E-9 in the PVA (Fig. 5, A to C)—whereas approx- 
imately half of the clusters were distributed over 
a few (two to four), often physically contiguous 
nuclei (Fig. 5C, unshaded clusters)—for example, 
cluster I-3 in the BNST-mv, PaAP, PS, and SHy 
and cluster I-12 in the StHy, MPA, and MPN 
(Fig. 5, A to C). This anatomical dispersion may 
reflect a similar function of the same cell type 
across distinct nuclei or a developmental rela- 
tionship of spatially distinct cells. By contrast, a 
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small fraction of the neuronal clusters were dis- 
persed and not enriched in any given nucleus, 
such as I-21 and E-22 (Fig. 5A). Whereas most 
nuclei were populated by both excitatory and 
inhibitory neurons, the PVA and BAC only con- 
tained excitatory clusters (Fig. 5C, green shaded 
row), and the BNST-p and BNST-mv contained 
only inhibitory clusters (Fig. 5C, blue shaded rows), 
which is consistent with previous observations 
of high expression of Slcl7a6 and Slc32al in 
these regions, respectively (35, 44). 

Neuronal clusters of the preoptic region ap- 
peared highly intermixed, with multiple clusters 
occupying any given nucleus. To quantify the 
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degree of intermixing, we calculated the neigh- 
borhood composition for each neuron. This anal- 
ysis showed that each neighborhood contained 
multiple clusters and was typically not dominated 
by a single cell population (Fig. 5, D and E). 
These direct spatial measurements allowed us 
to provide an anatomy-based taxonomy for the 
identified neuronal clusters, except for the dis- 
persed clusters, which we named on the basis 
of marker genes (table S9). The putative cor- 
respondence between these MERFISH clusters 
and scRNA-seq clusters allowed us to further 
assess the spatial locations of sCRNA-seq clusters 
and compare them with our earlier predictions 
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shown in Fig. 1, C and D. In nearly all cases, the 
predicted locations matched or partially over- 
lapped those of the corresponding MERFISH 
clusters (table S9), lending additional support 
to our earlier observations on the spatial re- 
lationship between transcriptionally similar clus- 
ters (Fig. 1, C and D). However, we caution that 
the predicted scRNA-seq cluster locations rep- 
resent rough approximations because of the 
relatively low resolution of available in situ 
hybridization data and suffer from occasional 
ambiguity in the spatial patterns of some marker 
genes. Moreover, because of the remaining het- 
erogeneity in some of the identified scRNA-seq 
and MERFISH clusters, the putative correspon- 
dence may only represent similarity between 
subsets of cells within these clusters. 


Spatial and molecular organization 
of socially relevant cell populations 


Neuromodulators, hormones, and associated sig- 
naling pathways play critical roles in hypothalamic 
functions, but analyses with cellular resolution 
have been limited owing to the low expression 
level of many of the corresponding receptors. 
MERFISH enabled us to examine the distribu- 
tion of these genes throughout the preoptic 
region, providing functional insights into the 
associated cell populations. 

Sex steroid hormones are essential to the de- 
velopment and modulation of social behaviors 
and reproduction. We examined the distribution 
of enzymes and receptors essential for steroid 
hormone signaling in the preoptic region. The 
enzyme aromatase (Cyp19al) converts testoster- 
one to estrogen and thus modulates steroid func- 
tion (39). MERFISH revealed aromatase-enriched 
clusters with distinct repertoires of sex hormone 
receptors (Fig. 6, A and B). Several of these clus- 
ters (such as I-2, I-13, 1-24, I-32, and E-12) expressed 
both androgen receptor (Ar) and estrogen re- 
ceptor a (Esrl), suggesting that in these cell pop- 
ulations, circulating testosterone can be converted 
into estrogen and thus affect gene expression in 
a cell-autonomous manner through Esr1 activa- 
tion. In addition, the aromatase-enriched clus- 
ters E-12 and I-2 substantially overlapped with 
locations of several Esr1-enriched clusters (Fig. 6, 
C to F), suggesting that estrogen synthesized by 
these aromatase-expressing cells may also act in 
a paracrine manner on cells of the nearby Esr1- 
enriched clusters, in addition to the autocrine sig- 
naling mode described above (Fig. 6G). 

Some Esrl-enriched and aromatase-enriched 
clusters exhibited differences in cell abundance 
of varying extent between males and females. For 
example, the Esr1-enriched cluster J-15 showed an 
appreciable enrichment in female animals, where- 
as the aromatase-enriched cluster I-2 showed a 
more modest male enrichment (Fig. 6H). I-2 over- 
lapped with the sexually dimorphic nucleus of the 
preoptic area (SDN-POA), and its marker gene 
Cplx3 was co-expressed with MoxD1 [a canonical 
SDN-POA marker (46) not in our MERFISH gene 
library] in cells of the SDN-POA and BNST (fig. 
S19). However, I-2:BNST/StHy/MPN spatially ex- 
tended beyond the boundaries of the SDN-POA, 
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Fig. 7. Subdivisions of neuronal populations expressing Gal or Adcyap1 revealed with MERFISH. 
(A) MERFISH subdivides galanin- and Adycapl-expressing cells into multiple transcriptionally and 
spatially distinct clusters. Color subdivision of the rectangles shows the nuclei spanned by individual 
clusters, colored identically to the nuclei abbreviations listed on the right. The nuclei abbreviations 
are as defined in Figs. 3F and 5B. (B) Expression distributions of selected marker genes and genes 
of interest for all neuronal clusters enriched in galanin (Gal). Expression distributions are calculated as in 
Fig. 2. (C) Spatial distributions of all inhibitory and excitatory Gal-enriched clusters. (D and E) As 

in (B) and (C) but for Adcyap1- and Bdnf-enriched clusters. The seven most enriched of the 

14 Adcyapl- and Bdnf-enriched clusters are shown. (F) in situ hybridization images of cFos (red), 
Sneg (green), and overlay of an anterior slice of the preoptic region taken from a heat-stressed 
animal. The blue boxed region is magnified and shown on the right. Sncg is a marker for the 
scRNA-seq cluster el3 that corresponds to the MERFISH cluster E-3 (table S9). 


suggesting that it may contain unresolved sub- 
populations, which may partially mask a sexual 
dimorphism in SDN-POA cell populations. 

We next considered clusters enriched in the 
expression of oxytocin receptor (Oxtr), an impor- 
tant modulator of social behaviors that exerts its 
effects broadly throughout the brain (47). Although 
the low expression level of Oxtr has previously 
made it challenging to identify oxytocin targets, 
the high sensitivity of MERFISH allowed us to 
detect enrichment of Oxtr in multiple clusters 
(Fig. 6, I and J). For example, the Oxtr-enriched 
BNST cluster I-24 coexpressed multiple sex hor- 
mone receptors as well as the neuropeptide Tac2 
implicated in social isolation stress (48), aggres- 
sion, and fear (48, 49), suggesting the involvement 
of oxytocin signaling in these functions. This clus- 
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ter was specifically activated after pup-directed 
aggression by virgin males as described below, 
corroborating studies that implicate the BNST 
in this function (50). This highlights a seemingly 
paradoxical role for Oxtr in agonistic pup en- 
counters versus its known roles in affiliative be- 
haviors (47). Oxtr was also found in the VLPO 
cluster I-5, implying that oxytocin might have a 
role in the modulation of VLPO functions such 
as sleep or temperature sensing (4, 57-53). 

The high throughput of MERFISH measure- 
ments allowed us to identify some extremely rare 
cell types. GnRH-expressing cells (E-30) repre- 
sent a rare cell population dispersed within the 
preoptic area and basal forebrain that integrates 
and orchestrates peripheral and central aspects 
of reproduction (54-56). Only a few cells were 
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found to express appreciable GnRH in our scRNA- 
seq data and did not form a distinct cluster. By 
contrast, MERFISH identified a GnRH-enriched 
cluster (Fig. 6, K and L), which expressed re- 
markably low levels of Esr1 and Ar (Fig. 6K), 
suggesting that GnRH neurons may receive in- 
direct feedback from circulating hormones within 
the hypothalamic-pituitary-gonadal (HPG) axis, 
potentially through synaptic input from hormone- 
responsive cell-types such as Kisspeptin (Kiss)- 
expressing cells (57). 


Partition of previously defined cell types 
into multiple cell populations 


Our MERFISH data also partitioned a number of 
previously reported single cell types—for example, 
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Gal-expressing and Adcyap1-expressing neurons— 
into multiple distinct cell populations (Fig. 7A). 
We observed 10 Gal-enriched MERFISH clusters, 
several of which were scattered across multiple 
nuclei, such as cluster I-14: MPA/MPN/StHy (Fig. 7, 
A to C). I-14 was strongly activated during parent- 
ing, as shown below, revealing that molecularly 
and functionally defined cell types can spread 
across multiple nuclei. 

Similarly, MERFISH identified 14 clusters 
enriched in Adcyap1 and Bdnf (Fig. 7, A, D, 
and E), which were previously designated as 
markers for warm-sensitive neurons (8). Yet 
only one of these clusters, E-3:AvPe/Pe/VMPO/ 
VLPO, displayed the established spatial location 


of warm-sensitive cells (Fig. 7E) (8). Indeed, upon 
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heat stress (29), a high level of the IEG cFos was 
expressed in cells in the region covered by E-3, 
and Sncg, a marker gene of E-3’s corresponding 
scRNA-seq cluster e13 (table S9), was highly 
enriched in these cFos-positive cells (Fig. 7F). 
E-3 expressed the leptin and prolactin recep- 
tors (Lepr and Prir) (Fig. 7D), suggesting a 
mechanism by which metabolic and reproduc- 
tive states may modulate thermoregulation. Pre- 
optic cells receiving projections from Arcuate 
Nucleus Kiss1 cells were recently implicated in 
the regulation of hormonally induced hot flashes 
via the activation of the receptor Tacr3 (58). E-3 
and e13 expressed both Kiss1 receptor and Tacr3, 
implicating the warm-sensitive cluster in the 
generation of hot flashes. 
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Neuronal cell types activated by key 
social behaviors 

To investigate the role of specific neuronal pop- 
ulations in discrete social behaviors, we included 
cFos in MERFISH measurements and charac- 
terized animals after parenting, aggression, or 
mating. We performed clustering analysis of these 
behavioral samples together with naive samples 
not subjected to behavioral stimuli and did not 
observe any cluster that was present only in be- 
havioral samples. For each behavior, only a few 
neuronal clusters, each characterized by key mark- 
ers, exhibited a statistically significant enrichment 
in cFos-positive cells (Fig. 8, A and B, and fig. S20). 
In addition, many other clusters showed a small 
fraction of cFos-positive cells, which together 
accounted for a substantial subset of all cFos- 
positive cells (Fig. 8A and fig. S20B). Activated 
neurons in all tested behaviors were predomi- 
nately inhibitory. 

We first examined clusters preferentially ac- 
tivated after pup exposure, which elicits par- 
enting behaviors in virgin females, mothers, and 
fathers but triggers aggression in virgin males 
(5, 59). We identified, with MERFISH, prefer- 
entially activated clusters—clusters enriched in 
cFos-positive cells (referred to as cFos enrich- 
ment hereafter) (Fig. 8, A and B, and fig. S20)— 
and validated, by use of two- or three-color in 
situ hybridization, all clusters that could be 
specifically defined by two marker genes and 
spatial location (Fig. 8C). In all animal groups 
that display parenting, I-14: MPA/MPN/StHy had 
substantially higher cFos enrichment than that 
of any other cluster (Fig. 8A and fig. S20). The 
expression of Gal and vasopressin receptor (Avpria) 
within this cluster (Fig. 8B) is consistent with 
the established role of Gal neurons in parenting 
(6) and their recently documented vasopressiner- 
gic input (60). Moreover, this cluster expressed a 
large set of hormone and peptide receptors (Fig. 
8B), substantiating the complex neuromodulation 
of parenting (67). In addition to I-14, we observed 
a preferential activation of clusters by pup ex- 
posure in a state-dependent manner. In mothers 
and fathers, cFos enrichment was observed in 
Oxtr-expressing cluster I-10: MPA/PaAP/SHy, and 
mothers additionally showed modest cFos enrich- 
ment in I-27:BNST and E-1:AvPe/Pe/MnPO/VMPO 
(Fig. 8 and fig. S20). Fathers additionally showed 
cFos enrichment in clusters I-2:BNST/StHy/MPN 
and in I-16:AvPe/Pe/SHy (Fig. 8 and fig. S20). 

By contrast, I-14 was not preferentially acti- 
vated in virgin males exposed to pups, which is 
consistent with their aggressive responses to- 
ward pups (5). Instead, I-16:AvPe/Pe/SHy and 
I-24:BNST exhibited cFos enrichment after pup- 
directed aggression (Fig. 8 and figs. S20 and S21). 
I-16 was also preferentially activated in virgin 
males that display inter-male aggression, as was 
1-2 (Fig. 8 and figs. S20 and S21), suggesting that 
1-16 is broadly involved in aggressive responses, 
whereas I-24 and I-2 may mediate differential re- 
sponses to pups and adults. I-16 was also acti- 
vated in fathers during parenting, which might 
indicate that the switch from pup-directed ag- 
gression to parenting (59, 61) occurs in circuit 
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nodes downstream of I-16, or that the role of 
I-16 in aggression is inhibited by pro-parenting 
circuits. The Gal- and Th-enriched cluster I-16 
expressed Vmat2 (Slcl8a2) and showed corre- 
spondence to the scRNA-seq cluster i16:Gal/Th 
(table S9), which additionally expressed Ddc, sug- 
gesting that this cell population is dopaminer- 
gic. The activation of a dopaminergic neuronal 
population during aggression may provide a 
cellular basis to understand the observations 
that dopamine is released during aggression 
and modulates aggressive behavior (6). I-16 also 
expressed the opioid receptors Oprd1 and Oprk1 
(Fig. 8B), supporting the effects of opioid receptor 
ligands on aggressive encounters in mice and 
other rodents (62). 

Next, we examined clusters activated by suc- 
cessful mating (29) to capture neural activity 
associated with appetitive and consummatory 
aspects of sexual behavior. The cluster I-15: 
AvPe/Pe/VMPO, which displayed a cell abun- 
dance enrichment in female mice compared with 
males (Fig. 6H), was preferentially activated 
in females and to a lesser extent in males after 
mating (Fig. 8 and figs. S20 and S21). Both I-14 
and I-15—activated by parenting and mating, 
respectively—expressed Esr1 (Fig. 8B), which is 
consistent with recent findings on the involve- 
ment of Esrl-expressing cells in parenting and 
mating behaviors (63, 64). However, our data 
showed that cells activated by parenting and 
mating belong to two distinct cell populations 
localized to distinct preoptic nuclei. More gen- 
erally, we found Esr1 expression in nearly all be- 
haviorally activated clusters, suggesting that this 
gene alone cannot define specific behaviorally 
relevant cell types. In addition to I-15, which was 
activated in both sexes after mating, we also ob- 
served a few clusters that exhibited sexually di- 
morphic cFos enrichment, such as I-16 in female 
mating and I-2, I-11, I-14, I-33, E-8, and E-15 in 
male mating (Fig. 8 and figs. S20 and $21). The 
weak activation of the parenting cluster I-14 after 
male sexual behavior is consistent with our pre- 
vious finding that a small subset of Gal neurons 
are activated in both mating and parenting (5) 
and may suggest a mechanism underlying the 
mating-dependent switch to parental behavior in 
virgin male mice. Intriguingly, the Th-enriched 
cluster I-16 was activated by different behaviors 
in animals of different sexes, mating in females 
and aggression in males, similarly to the function- 
al sexual dimorphism observed in a recent study 
of AvPe Th cells (6). 


Discussion 


Here, we combined the power of scRNA-seq and 
MERFISH to create a spatially resolved and func- 
tionally aware cell atlas of the preoptic region of 
the mouse hypothalamus. These methods iden- 
tified major cell classes and neuronal subpopu- 
lations with correlated gene expression profiles, 
providing cross-validations for both methods. 
Moreover, the two methods are complementary: 
scRNA-seq measured more genes than MERFISH 
and helped define marker genes for MERFISH, 
whereas MERFISH provided spatial context of 
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cells at high resolution as well as more accu- 
rate detection and quantification of weakly ex- 
pressed genes, including functionally important 
genes such as neuropeptide and hormone recep- 
tors. As a result, the combined data provided 
a more complete picture of the transcriptional 
diversity and spatial organization of individual 
cells in the preoptic region. 

We observed a remarkable diversity of neurons 
in this region, comprising ~’70 different neuronal 
clusters. Transcription factors and cell-surface 
markers have been observed as markers of neu- 
ronal identity in the mouse spinal cord (65) and 
cortex (66) and the Drosophila olfactory system 
(67). In the mouse preoptic region, genes dis- 
criminating neuronal clusters were enriched 
for neuropeptides and molecules involved in 
neuromodulator synthesis and transport and 
for transcription factors. By contrast, neuromod- 
ulator receptors were weaker discriminators of 
these neuronal populations or were expressed 
at low levels, providing a useful note of caution in 
using these genes for targeted functional studies 
in the preoptic region. Many of these neuronal 
populations were defined by a combination of 
multiple genes, indicating that genetic intersec- 
tional approaches will be most useful in the func- 
tional interrogation of specific cell types (67). 

The list of cell populations identified in this 
work substantially expands and further defines 
previously reported cell types in this region. We 
observed specific clusters enriched in genes pre- 
viously identified as markers of functionally im- 
portant preoptic cells (for example, cells involved 
in parenting, aggression, thermoregulation, sleep, 
or thirst), such as Galanin, Th, Adcyap1, Nts, Crh, 
Tacl, Cck, Agtria, Nos1, and aromatase (tables S5 
and S9). Our data also resolved many of the pre- 
viously described singular cell types into multiple 
cell populations. We observed good correlation 
between our data and recent scRNA-seq analy- 
ses of the whole mouse hypothalamus (68) and 
of the whole mouse brain (69). In the latter 
study, which we compared with ours in more 
detail because it had a larger whole hypothalamic 
dataset, 15 neuronal clusters were identified from 
~2000 profiled hypothalamic neurons (fig. $22) 
(69). However, because we used ~10 times more 
neuronal scRNA-seq profiles to characterize about 
one-fifth of the whole hypothalamus (the pre- 
optic region), we were able to analyze the pre- 
optic region with a greater depth and thus gain 
finer delineation of cell populations (fig. S22B). 
The neuronal populations that we uncovered in 
the preoptic region also largely differed from 
cell clusters described in scRNA-seq studies of 
other hypothalamic areas (33, 70), perhaps sug- 
gesting a molecular and cellular distinctiveness 
of this brain area. 

MERFISH further allowed us to map the spa- 
tial organization of cell populations. Structural 
features of the hypothalamus are not as visibly 
apparent as in laminated parts of the brain, and 
hypothalamic nuclei have largely been defined 
by subtle differences in neuronal density together 
with specific connectivity and functional roles 
(1). However, the differences versus similarities 
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in the cell-type composition and function of dis- 
tinct nuclei in the hypothalamus remains unclear 
(71). MERFISH allowed us to examine the or- 
ganization of distinct cell populations within 
individual hypothalamic nuclei, providing a 
framework with which to explore the molec- 
ular basis of their anatomical segregation. The 
spatial organizations of neuronal clusters were 
diverse: Many of the neuronal clusters were each 
primarily enriched in one or a few nuclei, whereas 
several clusters were substantially more dispersed. 
Moreover, individual nuclei were composed of 
multiple neuronal clusters. We also observed spe- 
cific topographical organizations that can sup- 
port defined modes of function; for example, the 
spatial proximity of aromatase- and Esrl-expressing 
cells may support paracrine estrogen signaling. 
Although aromatase- and Esrl-enriched cell pop- 
ulations regulate sex hormone production and 
signaling, rarely any of them appeared to be ex- 
clusively expressed in either sex, which is con- 
sistent with behavioral evidence that males and 
females are capable of exhibiting behaviors typ- 
ical of the opposite sex (5, 72). 

Last, the ability of MERFISH to interrogate in- 
tact tissue allowed us to include activity-dependent 
IEGs in our measurements, allowing the iden- 
tification of neuronal populations activated by 
specific behaviors. Using this approach to study 
several social behaviors—including parenting, 
aggression, and mating—we observed that only 
a small number of neuronal clusters displayed 
statistically significant enrichment in cFos- 
positive cells after each behavior. This observation 
supports a model in which genetically encoded 
circuits composed of transcriptionally distinct 
neuronal types control specific hypothalamic 
functions. However, in all three behaviors, we 
also observed widespread activation of many 
neuronal clusters at a substantially weaker level, 
suggesting a secondary role for many different 
neuronal types in these behaviors and possibly 
reflecting necessary cross-talk between different 
behavior circuits. We caution that the large range 
of cFos expression levels seen in our samples sug- 
gests that some activated clusters with low levels 
of activity-dependent cFos induction may not have 
been identified if their cFos levels were below 
the background noise in our measurements. 

This study also extended our previous work on 
circuits that underlie parenting behavior (5, 60) 
by resolving preoptic Gal neurons into several 
distinct subpopulations, with only one of them 
involved in both male and female parenting. In 
addition, we identified distinct cell populations 
that were differentially activated in mothers and 
fathers during parenting, providing insights into 
how physiological state may affect parental be- 
havior. Moreover, we identified cell populations 
associated with sexual behavior in males and fe- 
males as well as those involved in male aggression 
toward infants and conspecific males. Together, 
our data defined functionally relevant cell popu- 
lations that underlie social behavior with a high 
molecular and spatial resolution. 

Overall, our study demonstrates the power of 
combining scRNA-seq and MERFISH to map cell 
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types and their organization in the brain, reveal 
their functional roles in diverse behaviors, and 
generate hypotheses about structure-function 
relationships in neural circuits. The identifica- 
tion of marker gene combinations and spatial 
locations defining the neuronal populations in 
the preoptic region provides necessary tools for 
the precise targeting and perturbation of these 
neurons, thus enabling future functional studies. 
As an imaging-based approach, we envision that 
MERFISH can be combined with diverse imag- 
ing methods for anatomical tracing and func- 
tional interrogation to provide insights into how 
distinct cell types communicate to form func- 
tional circuits in the healthy and diseased brain, 
as well as in other tissues. 


Methods summary 


scRNA-seq of the preoptic region was performed 
by using protocols modified from (70) to increase 
neuronal survival. Tissue fixation and sectioning 
as well as MERFISH probe construction, staining, 
and imaging were performed by using estab- 
lished protocols (26). We imaged 155 genes in 
MERFISH measurements, with 135 genes im- 
aged by using combinatorial smFISH measure- 
ments and 20 additional genes imaged by using 
sequential rounds of noncombinatorial FISH. The 
sequences of all probes used for MERFISH are 
provided in tables S10 and S11. Individual cells 
were segmented with a seeded watershed algo- 
rithm by using DAPI and total mRNA costains 
(29). Cell clusters were identified by using Louvain 
community detection on a nearest-neighbor graph 
built on the statistically significant principle com- 
ponents of gene expression (28, 31, 73) modi- 
fied to allow an optimized choice of the number 
of nearest neighbors in the graph. Behavioral 
stimuli were applied to animals by using estab- 
lished protocols (5, 72, 74), and only animals scored 
as displaying the desired behavior were used 
for MERFISH measurements. 
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INTRODUCTION: Scientists are building de- 
tailed maps of the cellular composition in the 
human brain to learn about its development. 
In the human cortex, the largest area of the 
mammalian brain, neural circuits are formed 
through anatomical refinement, including axon 
and synaptic pruning, and the emergence of 
complex patterns of network activity during 
early fetal development. Cellular analyses in 
the human brain are restricted to postmortem 


material, which cannot reveal the process of 
development. Model organisms are, therefore, 
commonly used for studies of brain physiol- 
ogy, development, and pathogenesis, but the 
results from model organisms do not always 
translate to humans. 


RATIONALE: Systems to model human neu- 
ron dynamics and their dysfunction in vivo are 
needed. While biopsy specimens and the gen- 
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Human neuron dynamics imaged in vivo. We combined a human-specific genetic 
background with live imaging in cortical tissue grafts to investigate the earliest stages of 
human axon, synaptic, and network activity development and model Down syndrome. 
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eration of neurons from induced pluripotent 
stem cells (iPSCs) could provide the neces- 
sary human genetic background, two- and 
three-dimensional cultures lack factors that 
normally support neuronal development, in- 
cluding blood vessels, immune cells, and in- 
teraction with innervating neurons from other 
brain areas. On the basis of previous stem cell 

transplantation studies in 
mice, we reasoned that 
Read the full article the physiological micro- 


at http://dx.doi. environment of the adult 
org/10.1126/ mouse brain could sup- 
science.aaul810 port the growth of human 


cortical tissue grafts that 
had been generated from iPSC-derived neu- 
ronal progenitors. With human neurons im- 
planted into the mouse brain, high-resolution, 
real-time in vivo monitoring of human neuron 
dynamics for periods of time spanning the 
range from subseconds to several months be- 
comes feasible. 


RESULTS: We found that transplanted human 
iPSC-derived neuronal progenitors consist- 
ently assembled into vascularized territories 
with complex cytoarchitecture, mimicking key 
features of the human fetal cortex, such as its 
large size and cell diversification. Single-cell- 
resolution intravital microscopy showed that 
human neuronal arbors were refined via branch- 
specific retraction, rather than degeneration. 
Human synaptic networks restructured over 
the course of 4 months, while maintaining bal- 
anced rates of synapse formation and elimina- 
tion. Human functional neurons rapidly and 
consistently acquired oscillatory population 
activity, which persisted over the 5-month ob- 
servation period. Lastly, we used cortical tis- 
sue grafts derived from the fibroblasts of two 
individuals with Down syndrome, caused by 
supernumerary chromosome 21. We found 
that neuronal synapses in cells derived from 
these individuals were overly stable and that 
oscillatory neural activity was reduced in these 
grafts, revealing in vivo cellular phenotypes not 
otherwise apparent. 


CONCLUSION: By combining live imaging in 
a multistructured tissue environment in mice 
with a human-specific genetic background, 
we provide insights into the earliest stages of 
human axon, synaptic, and network activity 
development and uncover cellular phenotypes 
in Down syndrome. Our work provides an al- 
ternative experimental system that can be used 
to study other disorders affecting the develop- 
ing human cortex. 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
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In vivo modeling of human neuron 
dynamics and Down syndrome 


Raquel Real””*?*, Manuel Peter**, Antonio Trabalza’**, Shabana Khan’, 
Mark A. Smith”*, Joana Dopp’, Samuel J. Barnes”, Ayiba Momoh*, Alessio Strano“, 
Emanuela Volpi®, Graham Knott’, Frederick J. Livesey**+, Vincenzo De Paola’*+ 


Harnessing the potential of human stem cells for modeling the physiology and diseases of 
cortical circuitry requires monitoring cellular dynamics in vivo. We show that human 
induced pluripotent stem cell (iPSC)—derived cortical neurons transplanted into the adult 
mouse cortex consistently organized into large (up to ~100 mm*) vascularized neuron- 
glia territories with complex cytoarchitecture. Longitudinal imaging of >4000 grafted 
developing human neurons revealed that neuronal arbors refined via branch-specific 
retraction; human synaptic networks substantially restructured over 4 months, with 
balanced rates of synapse formation and elimination; and oscillatory population activity 
mirrored the patterns of fetal neural networks. Lastly, we found increased synaptic stability 
and reduced oscillations in transplants from two individuals with Down syndrome, 
demonstrating the potential of in vivo imaging in human tissue grafts for patient-specific 
modeling of cortical development, physiology, and pathogenesis. 


ellular analyses in the human brain are 

restricted mainly to postmortem material, 

which cannot provide direct observation 

of dynamic events, such as anatomical re- 

finement (7) and the emergence of com- 
plex patterns of network activity. This limitation 
raises the question of how to model human neu- 
ron dynamics and their dysfunction in the many 
incurable disorders that affect the developing 
cortex (2). 

Rodent models have been valuable for un- 
derstanding the pathophysiology of complex ge- 
netic disorders, such as Down syndrome (DS) 
(3-5), which is associated with neurodevelop- 
mental alterations and is caused by trisomy 
of chromosome 21 (Ts21), but certain pheno- 
types are better captured in the context of a 
human genetic background (6). 

Human induced pluripotent stem cell (iPSC)- 
derived neurons can be used in patient-specific 
studies to model human cortical development 
(7), but in vitro two-dimensional (2D) and 3D 
cultures (8, 9) lack key interactions with neuro- 
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glia and vasculature (10). Therefore, systems that 
more closely recapitulate the complex cellular 
dynamics of the living brain by using patient- 
specific cells are urgently needed. 

Building on previous transplantation work 
(11), we hypothesized that the existing physio- 
logical microenvironment in the adult mouse 
brain could support the expansion of human 
cortical tissue grafts from iPSC-derived neu- 
rons, thus allowing high-resolution, real-time 
in vivo monitoring of human neuron dynam- 
ics for extended periods of time. 

In this study, we used single-cell-resolution 
intravital microscopy (12) in human tissue grafts 
to gain insights into the dynamics of pruning, 
synaptogenesis, and network activity during the 
earliest stages of cortical neuron development 
and demonstrated this approach by modeling 
human neuron structural and functional dynam- 
ics in DS. This research was approved by the 
U.K. Stem Cell Bank Steering Committee and 
the U.K. Home Office, in accordance with the 
U.K. Code of Practice for the Use of Human 
Stem Cell Lines and the U.K. Animals (Scientific 
Procedures) Act 1986, respectively. 


Complex cytoarchitecture in human 
cortical tissue grafts 


To study the dynamics of human axon and syn- 
aptic development and population activity in 
vivo, we generated cortical excitatory neurons 
from a control human iPSC line (13) (fig. S1) and 
transplanted them into the adult mouse soma- 
tosensory cortex (SCx1) for chronic multiphoton 
imaging (Fig. 1A). Cells were transplanted after 
36 to 38 days of differentiation, a stage at which 
cultures contained ~50% neural progenitor cells 
and ~50% deep-layer cortical neurons [of which 
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~15% expressed T-box, brain 1 (TBR1+), and 
~85% expressed COUP transcription factor- 
interacting protein 2 (CTIP2+)] (fig. S2, A and 
B). As expected, and consistent with ongoing 
neurogenesis after engraftment, upper-layer cor- 
tical excitatory neurons and a small proportion 
of astrocytes and oligodendrocytes could also be 
found at both 3 and 5 months posttransplanta- 
tion (mpt) (fig. $2, C and D). Electron microscopy 
(EM) confirmed that human grafts resembled im- 
mature cortical tissue at 130 days posttransplan- 
tation (dpt) (fig. S3, A to C), with few synapses 
and few myelinated axons, and showed no de- 
tectable boundary with the mouse brain (fig. S3C), 
suggestive of structural integration (74). The grafts 
contained proliferating cells (fig. S3, C and D), 
enlarged with time (movie S1), and consisted of 
multiple human- and host-derived cell types (figs. 
S2 and S3). The cell types from the host included 
microglial cells, oligodendrocytes, astrocytes, 
and both excitatory neurons and inhibitory in- 
terneurons (fig. S3, D to F), whereas no inter- 
neurons of human origin were found (n = 3 
transplants). Microglia recruitment in the graft 
was minimal (fig. S4). Postmortem analysis re- 
vealed that the human tissue grafts developed 
organizational features resembling the structural 
arrangement of the early fetal cortex (fig. S5) 
(15, 16). 

At earlier stages (<2 mpt), cortical tissue grafts 
contained areas with ventricular zone-like terri- 
tories, with cells positive for Paired box protein 6 
(PAX6), a marker of neuronal progenitors, and 
Nestin, a marker for radial glia, which extended 
processes both radially outward from the core of 
the rosette-like structures (fig. S5A) and arranged 
in parallel (fig. S5B), mimicking the organization 
of radial fibers in the intermediate zone of the 
human fetal cortex (15). Ki67-expressing pro- 
liferating cells were found in the inner apical 
layer, with doublecortin (DXC)-positive immature 
neurons toward the basal part, extending out 
into the rest of the graft (16) (fig. S5A). After 
2 mpt, the rosettes did not persist, and although 
discrete cortical laminae were not clearly visible, 
consistent with their formation in late embry- 
onic development (~7 months postconception) 
(7), immunostaining for deep- and upper-layer 
cortical neurons with antibodies for TBR1 and 
Special AT-rich sequence-binding protein 2 
(SATB2), respectively, showed that these cell 
populations can segregate in vivo (fig. S5C). Hu- 
man astrocytes were homogenously distributed 
in the cortical tissue grafts (fig. S5D). Lastly, 
human tissue grafts were vascularized, as shown 
in vivo and by the endothelial marker cluster of 
differentiation 31 (CD31) (Fig. 1, B to D), suggest- 
ing that the adult mouse brain microenvironment 
can support the development of a multicellular 
transplant. 


Human axon pruning imaged in vivo 


To track human neurons in vivo, we engineered 
them to express green fluorescent protein (GFP) 
via lentivirus-mediated transduction before trans- 
plantation. Human neurons were present for 
the duration of our experimental time course, 
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Fig. 1. Single-cell-resolution in vivo imaging of human cortical tissue 
grafts reveals mechanisms of pruning. (A) Schematic of experimental 
design (left) and two-photon in vivo imaging time line (right). NeurRef, 
neurite refinement; CaDyn, calcium dynamics; SynDyn, synaptic dynamics. 
(B) Representative two-photon overview of the cranial window over the 
injection site at 3 mpt. (C) Bright-field view of a cranial window (~15 mm*) at 
5 mpt. Arrowheads indicate blood vessels. (D) Representative immuno- 
staining of endothelial marker CD31 in the human graft at 5 mpt. Arrowheads 
indicate blood vessels. hNu, human nucleus marker. (E) Representative 
example of axonal bundles (arrows) along blood vessels. Dashed red lines 
represent a blood vessel. (F) Representative example of axonal layering in 
human grafts. The example shown is the same as that in movie S3. (G) Example 
of a human neuron migrating (*) and remodeling the leading processes 
(arrows) over 7 hours. (H) Representative example of extensive remodeling 
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of a dendritic arbor in a human pyramidal neuron over 25 hours. (1) Pruning 
of axonal branch over 6 hours. Dashed red lines represent a blood vessel. 
(I’) Neurite degeneration over 22 hours. Arrows indicate axonal fragments. 
(J) Representative examples of axon elongation and retraction over 24 hours. 
The boxed area in the right panel is magnified in the inset. The arrows in 
the inset indicate EPBs. gc, growth cone. (K) Speed of neurite elongation and 
retraction at 3 mpt (n = 113 neurites from 104 cells in six animals, average 
17 cells per animal). Means and SEM are indicated. Mann-Whitney U test, 
***P < 0.001. (L) Proportion of neurites elongating, retracting, and stable in 
24-hour intervals at 3 mpt (n = 92 neurites from 88 cells in six animals, 
average 15 cells per animal). Error bars indicate SEM. Bonferroni's multiple 
comparisons test after one-way analysis of variance (ANOVA), Fo15 = 43.74, 

P <0.0001; *P < 0.05; ****P < 0.0001. Scale bars, 500 um (B), 100 um (D), 
50 um [(E) and (F)], 20 um [(G), (H), and (J)]J, 10 um (I), and 2 um (I’). 
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which spanned up to 6 months, and spread away 
from the injection site (Fig. 1B) [on average, up 
to 1.2 + 0.6 mm (mean + SD) from the bregma 
in the rostral direction over the first 3 mpt (n = 
4 mice)]. Consistent with the immature brain 
cell-cell interactions (10), human axons grew 
along blood vessels and as fiber bundles (Fig. 1E 
and movie S82), and parallel and radially oriented 
axonal layers could be detected below the dura 
mater (Fig. IF and movie S3), similar to the ones 
found in the human cortex (8). 

Given the widespread axonal extension out- 
side the graft area, we asked which brain regions 
human neurons target 5 mpt. Main SCx1 target 
areas showed a higher number of human fibers 
than in areas known to receive fewer projections 
from SCx1 (fig. S6), suggesting that the direction 
of axon elongation is targeted. For example, the 
ipsilateral motor cortex, striatum, thalamus, and 
contralateral SCx1 received more fibers than the 
cerebellum and substantia nigra, and the corpus 
callosum had more axonal tracts than the in- 
ternal capsule and cerebral peduncle (fig. S6), 
as expected from rodent tracing experiments 
(19). These data provide evidence for long-range 
(over centimeters) axon growth of grafted human 
neurons through the mouse adult brain and 
indicate that, although human axons are either 
not responsive to or can overcome the inhib- 
itory signals present in the adult mouse brain, 
they may be directed by existing guidance cues 
or paths. 

After an initial phase of growth (20), the 
selective pruning of axons and dendrites is 
thought to occur normally via retraction and 
degeneration during early development (2, 27). 
We explored the mechanisms of human neurite 
pruning up to 3 mpt (Fig. 1, G to L, and fig. S7, 
A and B). At this stage, neurons were still migrat- 
ing (Fig. 1G) and developing neural processes in 
a highly dynamic mode (Fig. 1, G to L). We 
tracked the fate of 92 human neurites from 88 
cells in six mice at 3 mpt (Fig. 1, G to L). Whereas 
most neurites (58.4% + 5.5%) elongated in 
24 hours, neurite refinement was dynamic, and 
interchanging retraction and elongation of indi- 
vidual neurites (31.0% + 2.1%) over 24 hours 
were observed (Fig. 1, I to L). Developmental 
neurite degeneration involves cytoskeletal de- 
struction with widespread fragmentation over 
a time scale of 12 to 48 hours (22), whereas re- 
tracting axons do not leave fluorescent fragments 
behind (23). Reducing the imaging interval from 
24: hours to 8 hours showed that branch pruning 
(Fig. 11) occurred mainly by retraction (91%), 
rather than degeneration (Fig. II’) (9%). Axonal 
en passant boutons (EPBs), one of the two types 
of presynaptic specialization on cortical axons 
(24), could be observed in branches with a 
growth cone elongating (Fig. 1J). Neural pro- 
cesses extended long distances (maximum neu- 
rite extension = 462.769 um in 24 hours) at a 
speed of 10.29 + 0.73 um/hour (Fig. 1K), com- 
parable to that observed in the neonatal mouse 
brain (23). Results were validated with tissue 
grafts from an independent control line (fig. S7, 
A and B). 
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Human synaptic development 

imaged in vivo 

Next, we studied the dynamics of synaptogen- 
esis up to 4 mpt. Hallmarks of developing syn- 
aptic networks are an increase in synaptic density 
over time, followed by pruning, and the acquisi- 
tion of a steady state with balanced rates of syn- 
aptic gain and loss (25). However, when and how 
human synaptic networks acquire these proper- 
ties is unclear. We first considered dendritic 
spine formation and elimination (Fig. 2, A to F). 

After the initial phase of cell migration and 
neurite remodeling (Fig. 1, G and H), neurons 
stabilized, allowing us to track the same cells 
over time (Fig. 2A and fig. S8). Dendritic spines, 
the structural correlates of mammalian excitatory 
synapses (26), were seen as early as 20 dpt (32.8 + 
5.5 dpt for either dendritic filopodia, considered 
to be the precursors of dendritic spines, or 
spines; 7 = 3 mice) (27, 28). We monitored >500 
dendritic segments from six mice over days. 
However, for most dendrites, the density of 
synapses was too low to quantitatively study 
the dynamics of dendritic spines before 3 mpt, 
as expected from previous human fetal cerebral 
cortex postmortem work (29) and the early de- 
velopmental stage modeled in this study. Eight 
neurons had sufficient dendritic spine numbers 
at 3 mpt to calculate spine density and turnover 
during three to four consecutive sessions of 
48-hour intervals (up to 6 days). The average 
spine density was similar to that in the human 
early fetal cerebral cortex (29) and constant over 
the imaging period (Fig. 2C) (0.043 + 0.006 
spines/um; 7 = 70 spines present in the first 
session, 176 in total; Kruskal-Wallis test, P > 
0.05). Synaptic structures were added and elim- 
inated at equal rates, even at these early develop- 
mental stages (Fig. 2D) (Wilcoxon matched-pairs 
signed-rank test, P > 0.05). The turnover ratio 
(TOR), a function of both spine gain and loss 
(30), was 46.9% + 5.3% over 4 days (Fig. 2E), 
indicating synaptic reorganization. 

To investigate the development of synaptic 
remodeling over time, we repeated the same 
experiment after 1 month. Again, spine density 
was constant over time (Fig. 2C) (0.112 + 0.024 
spines/um; 7 = 171 spines present in the first 
session, 291 in total; Kruskal-Wallis test, P > 
0.05). However, the average spine density was 
increased at 4 mpt. The majority of dendrites 
had balanced rates of dendritic spine gain and 
loss (Fig. 2D) (paired two-tailed ¢ test, P > 0.05), 
and only in one cell were we able to capture net 
synaptic pruning over 2 days (Fig. 2C, thick 
dashed line), consistent with the idea that a 
major phase of synaptic pruning occurs only at 
later developmental stages (28). 

The TOR over 4 days was 27.6% + 3.7%, which 
was lower than at 3 mpt (Fig. 2E). Consistently, 
the survival fraction, defined as the fraction of 
spines surviving as a function of time, was higher 
at 4 mpt (Fig. 2F), suggesting stabilization of 
dendritic spine dynamics over time. 

To more thoroughly assess synaptic dynamics, 
we also studied presynaptic terminals along 
human cortical axons (Fig. 2, G to L). The density 
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of boutons remained stable over time (Fig. 21) 
(0.051 + 0.0075 EPBs/um; m = 69 EPBs in the 
first session, 145 in total), indicating that axo- 
nal boutons were also added and eliminated at 
equal rates (Fig. 2L). The TOR over 4 days was 
45.1% + 3.6% (Fig. 2, J and K), denoting com- 
parable dynamics between dendritic spines and 
axonal boutons (at 3 mpt, Mann-Whitney U test, 
P = 0.34). 

In summary, we were able to study early 
events of human cortical neuron synaptogenesis 
over the first 4 mpt. Despite the low synaptic 
density, consistent with the primordial stage 
modeled in this study (29), we can draw a num- 
ber of conclusions about early in vivo human 
synaptic network development. First, trans- 
planted human neurons initially formed synap- 
tic structures within 4 to 12 weeks of in vivo 
development, similar to the human fetal cere- 
bral cortex (29). Second, they underwent synaptic 
reorganization. Third, they progressively increased 
dendritic spine density over 1 month. Finally, 
human neurons balanced the rates of synaptic 
gain and loss over a time scale of a few days. 


Functional human cortical networks 
imaged in vivo 

Patterned neural activity is thought to be fun- 
damental to neural circuit development in the 
immature brain (3/7, 32). Although spontaneous 
and sparse activity can be detected in human 
cortical network preparations in vitro, recapit- 
ulating patterns typical of early human cortical 
population activity, such as recurrent oscillatory 
bursts (32), remains challenging (33, 34). 

We first investigated the electrophysiological 
properties of transplanted cells. We performed 
ex vivo whole-cell recordings in coronal brain 
slices containing the grafts (fig. S9). Current- 
clamp recordings were made from 18 pyrami- 
dal neurons (n = 4 mice), as identified by using 
differential interference contrast microscopy 
and expression of either GFP or tdTomato and 
by filling neurons with Lucifer yellow dye be- 
fore post hoc anatomical inspection (fig. S9A). 
Patched grafted pyramidal neurons were at dif- 
ferent stages of biophysical maturation and de- 
velopment, with an average resting membrane 
potential of -53.8 + 1.7 mV, average capacitance 
of 19.4 + 2.2 pF, and average input resistance 
of 1.4 + 0.1 gigaohms. Although cells were qui- 
escent at resting membrane potentials, depola- 
rizing current steps evoked action potential firing 
in all pyramidal neurons tested (fig. S9B), with 
average action potential amplitudes of 91.3 + 
2.6 mV and half-widths of 2.2 + 0.2 ms. 

Immunohistochemistry showed glutamater- 
gic and GABAergic terminals within the human 
graft (fig. S10, A and B). To confirm that human 
neurons received both excitatory and inhibitory 
input, pyramidal neurons were voltage clamped 
(-70 mV) and spontaneous miniature excitatory 
postsynaptic currents (mEPSCs) were observed 
at a frequency of 0.30 + 0.05 Hz (5 of 18 neurons) 
with an amplitude of 20.1 + 3.2 pA, which were 
completely blocked by the o-amino-3-hydroxy- 
5-methyl-4-isoxazolepropionic acid receptor 
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Fig. 2. Developing human synaptic networks are characterized by 
substantial restructuring and balanced rates of gains and losses. 

(A) Overview of cranial window at 136 and 138 dpt. Red arrows point to 
examples of cells with a stable location over a 48-hour period. (B) Detail of a 
representative dendrite imaged over 24 hours (white box in the top panel 
and red box in fig. S8A). Green, red, and white arrowheads indicate gained, 
lost, and stable dendritic spines, respectively. (©) Dendritic spine density over 
4 to 6 days at 3 mpt (n = 8 cells, 140 mm of total dendritic length, from 
three animals) and 4 mpt (n = 6 cells, 0.93 mm of total dendritic length, 
from two animals). Two-way ANOVA, interaction F346 = 0.4357, P = 0.73. 
****P < 0.0001. (D) Average fractions of dendritic spines gained and lost over 
48 hours at 3 mpt (red, n = 8 cells) and 4 mpt (blue, n = 6). Two-way 
ANOVA, interaction Fy,24 = 0.1894, P = 0.67. Sidak’s multiple comparisons 
test, *P < 0.05 (gains); P = 0.063 (losses). ns, not significant. (E) Dendritic 
spine TOR over 4 days at 3 mpt (n = 8 cells) and 4 mpt (n = 6 cells). 
Mann-Whitney U test, *P < 0.05. Each data point represents a cell. 
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(F) Dendritic spine survival fraction at 3 mpt (red, n = 7 cells) and 4 mpt 
(blue, n = 6 cells). Two-way ANOVA, interaction F347 = 1.513, P = 0.22; 

*P < 0.05. (G) Representative example of a branched human axon at 130 dpt. 
The arrow indicates a growth cone. The boxed area is magnified in 
subsequent panels. (H) Detail of the axon shown in the boxed area in (G), 
imaged every 48 hours over 4 days. Green, red, and white arrowheads 
indicate gained, lost, and stable EPBs, respectively. (I) EPB density over 2 to 
4 days at 3 mpt (n = 8 cells, 1.3 mm of total axonal length, from three 
animals). One-way ANOVA, F237 = 0.4014; P = 0.68. (J) Quantification of 
EPB TOR over 4 days at 3 mpt (n = 4 cells). Each data point represents an 
axon. (K) Quantification of EPB survival fraction at 3 mpt (n = 8 cells). 

(L) Average fractions of EPB gains and losses over 48 hours at 3 mpt 

(n = 8 cells). Wilcoxon matched-pairs signed-rank t test; ns, not significant. 
[(C), (D), (F), (1), (K), and (L)] Dashed lines represent individual cells, 

and solid lines represent means. Scale bars, 50 wm (A), 20 um [(B), top 
panel], 2 um [(B), bottom panel], 10 um (G), and 5 um (H). 
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antagonist 2,3-dihydroxy-6-nitro-7-sulfamoyl- 
benzo-quinoxaline-2,3-dione (NBQX) (n = 4). Al- 
though synaptic events were observed in the 
remaining neurons, spontaneous frequency was 
insufficient to acquire enough events for sta- 
tistical analysis (figs. S7, C and D, and S9C). By 
using a high-chloride (130 mM) internal solu- 
tion and in the presence of NBQX, spontaneous 
miniature inhibitory postsynaptic currents were 
observed at a frequency of 0.24 + 0.12 Hz (three 
of six neurons) with an amplitude of —73.3 + 
21.0 pA, which were fully inhibited by bicucul- 
line (fig. S9C). Similar to MEPSCs, inhibitory 
synaptic events were observed in the remaining 
neurons, but insufficient events were acquired 
for detailed kinetic analysis. In summary, grafted 
neurons are excitable and fire action potentials. 
In addition, they receive both excitatory and in- 
hibitory input, suggesting functional network 
connectivity. 
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To determine the origin of the afferent syn- 
aptic input to the functionally active neurons, 
we performed monosynaptic retrograde tracing 
by using a modified rabies virus. This virus lacks 
a glycoprotein needed for replication and can 
infect only cells expressing the avian tumor 
virus receptor A (TVA) (fig. S11). Human iPSC- 
derived cortical progenitors and neurons were 
transduced with a lentiviral vector containing 
the TVA, nuclear GFP, and glycoprotein under 
the control of the human synapsin promoter 
(fig. SILA). Five months after the transplanta- 
tion, the modified mCherry expressing-rabies 
virus was injected in the same location, where 
only grafted cells expressing the TVA are sus- 
ceptible to infection. Cells that are mono- 
synaptically connected to the infected human 
cells also become infected and express mCherry, 
allowing for accurate tracing of the neural input 
to the cells in the human grafts (fig. SIIB). We 


observed that whereas most of the input to the 
transplanted human neurons comes from other 
human neurons (92.5% + 1.5%, n = 4333 cells in 
two brains), host neurons also innervate the 
human graft (7.5% + 1.5%, n = 397 cells in two 
brains) (fig. S11C). The traced host neurons were 
located within the graft, in the cortical areas 
adjacent to the graft, in the contralateral cortex, 
and in the ipsilateral CA1 hippocampal region 
(fig. S11B). Although no traced neurons were 
found in other subcortical regions, thalamocor- 
tical terminals were present in the graft (fig. S10; 
see also fig. S12) (20). These results provide evi- 
dence that most synaptic input to the grafts comes 
from other human neurons. Furthermore, as no 
interneurons of human origin were found, these 
data, together with the demonstration that human 
neurons in the graft receive inhibitory input (fig. 
S9C, bottom), suggest that inhibition in the hu- 
man grafts comes from the host. 
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| 200 %AFIF, 
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Fig. 3. In vivo calcium imaging shows that patterned population 
activity emerges early and has a defined spatiotemporal order. 

(A) Example of an imaged cortical region taken from a WT-1 graft at 1 mpt 
in the somatosensory cortex of an adult mouse. Neurons express 
tdTomato (red) and GCaMP6 (green). GCaMP-positive neurons are shown 
as a maximum-intensity projection of activity over a 4-min period of 
spontaneous activity. Active neurons (yellow) are shown by overlaying the 
images (merge). (B) Representative AF/Fo calcium traces (where AF/Fo 
is the ratio of the change in fluorescence to the baseline fluorescence) 
from five active neurons imaged in a WT-1 graft at 1 mpt. (C) Distribution of 
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spontaneous calcium activity in WT-1 grafts at 1 to 2 mpt. Activity was 
measured as the integral of the average AF/Fo signal over the entire region 
of interest (ROI), normalized to the total duration of the recording in 
seconds (n = 88 cells, six ROIs, three mice). (Inset) Percentage of ROIs in 
WT-1 grafts at 1 to 2 mpt (3 of 16 ROls, 18.8%; n = 4 mice) and 3 mpt 
(31 of 35 ROls, 89.0%; n = 5 mice) that exhibit bursts. Chi-square test, 
*P < 0.05. (D) Montage of image frames from a typical recurrent burst in 
a WT-1 graft. (E) Example of burst activity over two different spatial 
regions (gray and black) shown on the left, taken from the bursts in (D). 
Scale bars, 10 um (A) and 20 um (D). 
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Fig. 4. In vivo modeling of structural 
and functional neuronal dynamics in 
tissue grafts from individuals with 
DS. (A) Representative example of axon 
elongation in a Ts21-1 neuron over a 
24-hour period. The inset corresponds to 
the boxed area and highlights the pres- 
ence of EPBs. The red line indicates 
alignment between the top and bottom 
images. (B) Example of axonal branch 
retraction (arrows) in a Ts21-1 neuron over 
17 hours. (C) Proportion of elongating, 
retracting, and stable neurites in 24-hour 
intervals in WT-1 (n = 96 neurites from 

79 cells, seven grafted animals, average 
1 cells per animal), Ts21-1 (n = 65 neurites 
from 60 cells, seven grafted animals, 
average 9 cells per animal), WT-2 

(n = 65 neurites from 53 cells, four grafted 
animals, average 13 cells per animal), 

and Ts21-2 (n = 60 neurites from 51 cells, 
four grafted animals, average 13 cells per 
animal) grafts at 3 wpt. WT-2 is a revertant 
disomic cell line from Ts21-2. Unpaired 
two-tailed t test; ns, not significant. 
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Each data point represents an animal. 
(D) Speed of neurite elongation and 
retraction in WT-1 (n = 96 neurites from 
73 cells, average 10 cells per animal), 
Ts21-1 (n = 62 neurites from 54 cells, 
average 8 cells per animal), WT-2 

(n = 53 neurites from 47 cells, average 

12 cells per animal), and Ts21-2 

n= 54 neurites from 46 cells, average 

2 cells per animal) grafts at 3 wpt. 
Unpaired multiple t test; ns, not significant. 
Each data point represents an animal. 

E) Example of dendritic branches and 
spines on a Ts21-1 neuron, imaged 

at 48-hour intervals for 4 days. The boxed 
region in the left panel is magnified in 
subsequent panels. Green, red, and white 
arrowheads indicate gained, lost, and re) 
stable dendritic spines, respectively. ita 
(F) 3D rendering of the same dendritic 
region imaged in vivo in (E), obtained from 
EM reconstruction. Presynaptic terminals 
are shown in green. (G) EM images of the 
dendritic spines marked with 1 and 0. 

2 in (E). Arrowheads indicate the loca- & oO Rs oOo 
tion of synapses. Asterisk, presynaptic 

terminal. (H) Dendritic spine survival fraction over 4 days in WT-1 (n = 10 cells from 
two animals), Ts21-1 (n = 9 cells from four animals), and Ts21-2 (n = 7 cells from 
two animals) grafts at 3 to 4 mpt. Two-way ANOVA, interaction F4¢9 = 5.435, 

P =0.0007; Tukey's multiple comparisons test, ****P < 0.0001. Each data point 
represents a cell. (I) Quantification of dendritic spine TOR over 4 days in WT-1 

(n = 10 cells from two animals), Ts21-1 (n = 9 cells from four animals), and Ts21-2 
(n =7 cells from two animals) grafts at 3 to 4 mpt. Sidak’s multiple comparisons 
test after one-way ANOVA, F223 = 3.078, **P < 0.01; ***P < 0.001. Each data 
point represents a cell. (J) Representative example of an axon on a Ts21-2 neuron 
imaged at 48-hour intervals for 4 days. The arrowheads in the insets indicate 
stable (white), new (green), and lost (red) EPBs. (K) EPB survival fraction over 
4 days in WT-1 (n = 6 cells), Ts21-1 (n = 24 cells), and Ts21-2 (n = 10 cells) grafts 
from three mice each at 3 to 4 mpt. Two-way ANOVA, interaction Fai = 0.8211, 
P=0.51; ns, not significant. Each data point represents an axon. (L) EPB TOR 
over 4 days in WT-1 (n = 6 cells), Ts21-1 (n = 24 cells), and Ts21-2 (n = 10 cells) 
grafts from three mice each at 3 to 4 mpt. Sidak’s multiple comparisons test after 
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one-way ANOVA, F237 = 5.588, **P < 0.01; ns, not significant. Each data point 
represents an axon. (M and N) (Left) Example of imaged cortical regions taken 
from Ts21-1 (M) and Ts21-2 (N) grafts in the somatosensory cortices of adult 
mice. Neurons express tdTomato (red) and GCaMP6s (green). Active neurons 
(yellow) are shown by overlaying the images. (Right) Representative AF/Fo 
calcium traces from five active neurons imaged in Ts21-1 (M) and Ts21-2 (N) 
grafts. Note weak synchronized burst activity across different neurons compared 
with the traces in fig. S7E. (O) Percentage of ROIs in WT-1 (50 of 52 ROls, 
96.1%, six grafted mice), Ts21-1 (10 of 38 ROls, 26.3%, three grafted mice), WT-2 
(34 of 34 ROls, 100%, three grafted mice), or Ts21-2 (11 of 23 ROls, 478%, 
three grafted mice) grafts that exhibit bursts at 3 to 5 mpt. Z test, ***P < 0.001. 
(P) Frequency of burst events in WT-1, Ts21-1, WT-2, and Ts21-2 grafts measured 
at 3 to 5 mpt. Kruskal-Wallis test, **P < 0.01; ***P < 0.001. (Q) Global ROI activity in 
WT-1, Ts21-1, WT-2, and Ts21-2 grafts measured at 3 to 5 mpt. Kruskal-Wallis test, 
***P < 0.001. Error bars in (P) and (Q) indicate SEM. Scale bars, 10 um [(A) and 
(B)], 5um [(E), left, and (J)], 2 um [(E), right], 0.2 um (G), and 20 um [(M) and (N)]. 
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To assess the functional development of cor- 
tical networks in vivo, we engineered neurons to 
express the genetically encoded calcium indi- 
cator GCaMP6s (35) before grafting and studied 
calcium-mediated neuronal activity in vivo (Fig. 3) 
(n = 8 mice). Spontaneous, sparse activity (Fig. 3, 
A to C) was detected as early as 2 weeks post- 
transplantation (wpt) and persisted up to 3 mpt 
(Fig. 3C). In addition, bursts of activity syn- 
chronized across the neuropil and multiple 
cells (37) were also detected at 1 mpt (Fig. 3C, 
inset) and persisted in all grafts tested up to 
5 mpt (fig. $7, E to H, and movies S4 and S5). 

Many of these bursts had a defined spatio- 
temporal order (Fig. 3, D and E), as well as re- 
current oscillatory behavior (<1 Hz between 
events) (Fig. 3D and fig. $7, E to I), with dif- 
ferent incidences between 1 and 3 mpt (Fig. 3C, 
inset), resembling activity recorded in the de- 
veloping human cortex (36, 37) and consistent 
with a report on transplanted human cerebral 
organoids (38). 

Recordings of calcium signals with air-puff 
stimulation of the animal’s whiskers and facial 
skin revealed that grafted neurons in the primary 
somatosensory cortex can be responsive to sen- 
sory stimulation (fig. $12) (~30% of the stimula- 
tion trials in one mouse; neither of the other two 
animals tested showed sensory-evoked activity), 
indicating that thalamocortical synapses can func- 
tionally drive activity in the human graft at 6 mpt. 


Imaging human neuron structural and 
functional dynamics in DS 


So far, we have characterized the structural and 
functional dynamics of human cortical neurons 
during the earliest phases of their development 
in vivo (Figs. 1 to 3) and validated the main re- 
sults with neurons from an independent control 
iPSC line [designated WT-2 (for wild-type line 
2)] (fig. S7). To model the in vivo dynamics of 
pruning, synaptogenesis, and network activity 
in a complex genetic disorder, we first generated 
iPSC-derived progenitors and neurons from two 
individuals with DS (fig. S1) and then trans- 
planted the cells into adult immunodeficient 
mice. During the reprogramming process of 
one of these lines [designated Ts21-2 (Ts21 
line 2)], we identified a disomic clone that had 
lost one extra copy of human chromosome 21 
(Hsa21) (yielding a WT-2 population) (39-41). 
We used a microsatellite short tandem repeat 
(STR) assay to confirm that the parental fibro- 
blast population was not mosaic for disomy and 
trisomy of chromosome 21 and that Ts21-2 and 
WT-2 are otherwise identical to each other and 
the initial fibroblasts (fig. S13). This revertant 
disomic line (WT-2) allowed us to highlight 
phenotypes caused by an extra copy of Hsa21, 
rather than by genetic differences between in- 
dividuals, without the need for multiple con- 
trol lines, which are typically required to 
control for genetic variations or diverse differ- 
entiation potencies observed in genetically dis- 
tinct human iPSC lines (42). A genome-wide 
copy number single-nucleotide polymorphism 
assay confirmed that the two Ts21 iPSC lines 
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(Ts21-1 and Ts21-2) had normal karyotypes, 
except for the extra copy of Hsa21 (fig. $14). 
Fluorescence in situ hybridization (FISH) on 
cortical tissue grafts further verified the presence 
of the extra copy of Hsa21 (fig. S15). The Ts21 
lines generated progenitors, neurons, and pro- 
liferating cells similarly to control grafts at 5 mpt 
(figs. S16 and S17). Astroglia, however, were over- 
produced in Ts21 grafts (fig. S16), recapitulat- 
ing the human pathology (43). Ts21 neurons 
were also present in stable locations to the end 
of our experimental time line, allowing for 
in vivo single-cell tracking (fig. S18). Chronic 
in vivo imaging revealed that Ts21 neurons had 
rates of axon growth and retraction similar to 
those of control neurons at 3 wpt (Fig. 4, A to D), 
suggesting normal early developmental axon 
refinement. In addition, Ts21 neurons in the 
graft formed morphologically mature synaptic 
structures, which were plastic over time (Fig. 4, 
E to L). To determine whether dendritic spine 
growth was associated with synapse formation 
in Ts21 neurons, we reconstructed in one trans- 
plant, with EM, a subset of the same dendrites 
after long-term in vivo imaging (Fig. 4, E to G). 
We found that newly formed dendritic spines 
formed synapses in 14 of 34 cases (41%) and six 
of them (6 of 14, 43%) within 48 hours of their 
first appearance. Serial EM reconstructions re- 
vealed that human dendritic spines and pre- 
synaptic terminals contained a postsynaptic 
density and synaptic vesicles, respectively, sug- 
gestive of complete synaptic maturation (Fig. 4G). 
Whole-cell recordings from coronal brain slices 
containing the Ts21 grafts showed normal syn- 
aptic input on the DS donor-derived neurons 
compared to the control (fig. S19, A to D), sug- 
gesting functional synaptic connections. Lon- 
gitudinal in vivo imaging, however, showed that 
dendritic spines, and to a lesser extent synaptic 
boutons (Fig. 4, J to L), were more stable in 
neurons from both individuals with DS than 
in the control, as demonstrated by higher sur- 
vival and reduced turnover (Fig. 4, H and I, 
and fig. S20). High density of GFP-positive neu- 
rons prevented a quantitative analysis of synap- 
tic dynamics in the WT-2 line. To understand 
whether the higher dendritic spine survival rates 
in Ts21 lines lead to higher spine density, we 
quantified dendritic spine density across the 
four lines (Ts21-1, Ts21-2, WT-1, and WT-2). We 
found an increase in dendritic spine density in 
neurons from the Ts21-1 line compared with 
WT-1 (fig. S21A), although this increase did not 
reach significance, consistent with postmortem 
fetal DS brain analysis at ~5 to 8 gestational 
months (27). However, we found higher spine 
densities in Ts21-2 than in WT-2, our most 
reliable comparison (fig. S21A). Putting the data 
from the two Ts21 and WT lines together high- 
lighted a significant spine density increase in 
the Ts21 cells (fig. S21B). Overall, these data 
raise the possibility that spine density in DS 
cortical neurons is higher than in the control, 
at least at the early developmental stages tested. 
No difference in EPB density was found across 
the four lines (fig. S21, C and D). 
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To further investigate the increased synaptic 
stability phenotype, we studied neural popula- 
tion activity, a main regulator of postnatal syn- 
aptic refinement and stabilization (26), through 
in vivo calcium imaging of GCaMP6-expressing 
Ts21 grafts (Fig. 4, M and N). We measured 
both burst and global activity (see methods). 
These measures were reduced in Ts21 grafts 
(Fig. 4, O to Q). Together, these data highlight 
in vivo synaptic stability and functional early 
cortical network phenotypes in DS. 


Conclusion 


We investigated the earliest stages of human 
axon, synaptic, and network activity development 
in a complex genetic disorder by combining live 
imaging in a multistructured tissue environ- 
ment and a patient-specific genetic background. 

Transplanted human neurons continued to de- 
velop and mature in vivo, in a microenvironment 
that retained features reminiscent of the human 
fetal cortex, such as large size (up to ~100 mm? 
at 5 mpt) (movie S1), temporal order and dura- 
tion (i.e., many months) of neurogenesis (20), 
vascularization, and cell diversification (human- 
derived cortical progenitors, neurons, oligodendro- 
cytes, and astrocytes together with host-derived 
microglia and vessels), as well as complex cyto- 
architecture. However, the extent to which neu- 
rons in human cortical tissue grafts, generated 
from either human iPSCs (present study) or em- 
bryonic stem cells (ESCs) (20, 38, 44), can mimic 
the maturation, complexity, and functionality of 
early human fetal cortical networks remains to 
be fully established. 

Repeated imaging of single human neurons in 
cortical tissue grafts enabled us to gain insights 
on pruning, synaptic refinement, and functional 
neural network formation in vivo. We found that 
pruning occurred mainly by branch-specific re- 
traction, rather than degeneration. 

Nascent human excitatory synaptic networks 
already had balanced rates of synaptic gain and 
loss over ~1 week at the single-cell level, sug- 
gesting that immature human neurons possess 
intrinsic programs of synaptic turnover regu- 
lation over relatively short time scales. Human 
synaptogenesis and axon growth were concur- 
rent, rather than happening at different times, 
confirming previous postmortem static analysis 
(28) and revealing conservation of this develop- 
mental growth program between species (45). 

Oscillatory population activity had marked 
neuropil and soma synchronization, which be- 
came more prominent over 2 months, under- 
scoring on-going modifications of cortical circuits. 
Results were robust across two independent 
control lines, providing a basis for applying 
this approach, which combines live imaging 
in a multistructured tissue environment with 
a patient-specific genetic background (46), to 
many other neurodevelopmental diseases affect- 
ing the cortex. 

In this study, we modeled a complex genetic 
disorder and saw that whereas developmental 
axon refinement was normal, synapses were 
more stable and neural network activity was 
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reduced in tissue grafts from two individuals 


with DS, suggesting a possible role for patterned 


activity in regulating synaptic lifetimes in the 
early stages of human cortical circuit develop- 


ment (32). These deficits were evident even after 
Ts21 cells were exposed to the in vivo physio- 
logical microenvironment of the mouse brain 
for several months, indicating cell-intrinsic de- 


ficits. By using a revertant disomic iPSC line, 


we showed that the population activity deficits 
were rescued by the loss of an extra copy of 
Hsa21, indicating that heightened expression 
of Hsa21 genes is both necessary and sufficient 
to disrupt oscillatory burst activity in developing 


cortical DS networks in vivo. 

In most previous work, human ESC- or iPSC- 
derived neurons have been transplanted into 
the damaged cortex (38, 47), spinal cord (48), 


striatum (49, 50), or retina (57), with the aim 


of cell replacement (7) rather than for disease 
modeling (6, 52), as demonstrated in our study. 
Transplantation and in vivo imaging for disease 


modeling in mice is advantageous over that in 


higher species such as primates, as larger num- 
bers of animals can be used to track cells in the 


grafts over long periods of time and the model 


provides a microenvironment containing ves- 


sels, immune cells, and innervation, not present 


in common in vitro preparations. 
In summary, we established a new in vivo 
experimental model of DS to study how the 


chromosomal abnormality affects the earliest 
stages of human axon, synaptic, and functional 
neural network development. We expect that 
this single-cell-resolution intravital microscopy 


approach will advance the knowledge of cel- 


lular pathophysiology in this and other neuro- 


developmental disorders, particularly valuable 
in light of the scarcity of early human fetal 
brain tissue material. 
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TROPICAL STORMS 


Dominant effect of relative tropical 
Atlantic warming on major 
hurricane occurrence 


H. Murakami”*}, E. Levin’, T. L. Delworth’”, R. Gudgel’, P.-C. Hsu* 


Here we explore factors potentially linked to the enhanced major hurricane activity in the 
Atlantic Ocean during 2017. Using a suite of high-resolution model experiments, we show 
that the increase in 2017 major hurricanes was not primarily caused by La Nifa conditions 
in the Pacific Ocean but rather triggered mainly by pronounced warm sea surface 
conditions in the tropical North Atlantic. Further, we superimpose a similar pattern of 
North Atlantic surface warming on data for long-term increasing sea surface temperature 
(a product of increases in greenhouse gas concentrations and decreases in aerosols) to 
show that this warming trend will likely lead to even higher numbers of major hurricanes in 
the future. The key factor controlling Atlantic major hurricane activity appears to be the 
degree to which the tropical Atlantic warms relative to the rest of the global ocean. 


he 2017 hurricane season in the North 

Atlantic Ocean was highly active, with six 

major hurricanes (MHs) causing wide- 

spread damage over the Gulf Coast and 

the Caribbean (J, 2) (Fig. 1, A and C). Con- 
sidering the mean seasonal MH number (2.7) 
and its standard deviation (1.9) during the period 
1979-2017 (Fig. 1B), the positive anomaly of MHs 
was almost two standard deviations above nor- 
mal. Specifically, among these six MHs in 2017, 
Hurricanes Harvey, Irma, and Maria made land- 
fall over the Gulf Coast and the Caribbean (Fig. 1, 
A and C), causing substantial damage in these 
coastal regions (2). Moreover, Hurricane Harvey 
ended a 12-year period of no landfalling MHs 
in the United States [the so-called “MH landfall 
drought” (3, 4)], with the most recent landfalling 
MH (Hurricane Wilma) having occurred in 2005. 
The highly active MH season in 2017 has attracted 
considerable attention throughout the scientific 
community (5), as well as broader society (6), in 
terms of its causes and also whether anthropo- 
genic forcing played a role (2). 

A number of factors might have caused the 
active MH season. The boreal summer season in 
2017 was characterized by a developing moderate 
La Nifia and associated conditions [see region A 
in Fig. 1D; the Nifio3.4 index was 0.4 standard 
deviations below normal (Fig. 1B)]. It is known 
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that, during summers with a developing La Nifia, 
hurricanes are more active over the North 
Atlantic due to a weakening of the vertical shear 
of the zonal winds over the tropical Atlantic 
relative to climatology (7, 8). The correlation co- 
efficient (hereafter, 7) between the Nifo3.4 index 
and the observed MH frequency for the period 
1979-2017 is —-0.45, which is statistically signifi- 
cant (P value < 0.01) (Fig. 1B). It is also apparent 
that the surface ocean in boreal summer 2017 
was substantially warmer than the climatological 
mean in the tropical North Atlantic, where most 
tropical cyclones are generated [the sea surface 
temperature anomaly (SSTA) was 1.5 standard 
deviations above normal in region B in Fig. 1D, 
hereafter referred to as the main developing re- 
gion (MDR); fig. S1]. On the basis of observations 
made over the past 60 years, Kossin (9) reported 
that ambient environmental vertical wind shear 
(10-12), defined as wind speed difference be- 
tween upper troposphere (200 hPa) and lower 
troposphere (850 hPa) and considered a detri- 
mental factor for tropical cyclone genesis and 
intensification, tends to be stronger off the east 
coast of the United States (region C in Fig. 1D) 
when sea surface temperature (SST) is higher 
over region B, leading to fewer MHs over region 
C. However, this relationship is not clear for the 
2017 summer. Observations for the 2017 summer 
within region C show a mixture of wind shear 
anomalies (fig. S2A): weaker in the Gulf of Mexico 
and stronger along the Canadian coast despite the 
positive SSTA over region B. Meanwhile, a sub- 
stantial positive SSTA was also observed over 
region C. It is expected that tropical cyclones 
obtain more energy from warmer oceans and then 
further develop into MHs during their westward 
propagation from the MDR. 

Overall, the SSTA spatial pattern in 2017 over 
the North Atlantic resembles a positive phase 
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of the Atlantic Multidecadal Oscillation (AMO) 
(13). The observed AMO index is 1.5 standard 
deviations above its long-term mean for the 2017 
summer season (Fig. 1B). Indeed, the AMO index 
is moderately and positively correlated with the 
number of MHs for the period 1979-2017 (r = 
+0.50; P value < 0.01) (Fig. 1B). There is, however, 
another mode of variability over the region: the 
Atlantic Meridional Mode (AMM) (/4). The AMM 
has substantial variability at the interannual time 
scale in the North Atlantic, characterized by a 
meridional contrast in the SSTA across the Equa- 
tor, as well as associated surface wind anomalies. 
The correlation coefficient between the observed 
AMM index and MH frequency is +0.62 (P value < 
0.01) (Fig. 1B), and the observed AMM index for 
the 2017 summer season was 1.2 standard devi- 
ations above normal. These high positive indices 
imply a possible impact of internal natural var- 
jiability on this active MH season in 2017 (15). 
However, it is also possible that greenhouse gas- 
induced global warming might have caused the 
emergence of the pronounced MH activity in the 
2017 hurricane season. This supposition is based 
on the findings of several previous modeling studies 
that commonly projected an increase in intense 
storms such as MHs under conditions of increased 
anthropogenic forcing (76-18). In this study, using 
a suite of high-resolution model experiments 
(19, 20), we attempted to elucidate the physical 
reasons for the occurrence of the active 2017 MH 
season and to anticipate possible future changes 
in MH activity, given conditions similar to those 
of summer 2017 but with increased anthropo- 
genic forcing. 


Successful prediction of active 
2017 MHs 


The particularly active MH season in 2017 was 
forecast well in real-time seasonal predictions 
starting from initial conditions on 1 July, using a 
high-resolution global coupled model [HiFLOR 
(19, 20)] developed at the Geophysical Fluid Dy- 
namics Laboratory. Previous studies have shown 
that HiFLOR can simulate the observed inter- 
annual variation of MH frequency in historical 
simulations (19) and offers skill in retrospective 
seasonal predictions (20) (the r between the pre- 
dicted and observed MH frequency is +0.74 for 
the period 1980-2017; see fig. S3). Figure 1E shows 
the predicted 2017 seasonal-mean MH density 
anomaly (July to November) from initial condi- 
tions on 1 July 2017 (i.e., lead month = 0 to 4), 
which can be contrasted with the corresponding 
observations shown in Fig. 1C. HiFLOR success- 
fully predicted the observed higher MH density 
along the eastern coast of the Caribbean and 
Florida, although the predicted density anomalies 
were underestimated. HiFLOR realistically pre- 
dicted the spatial pattern of the MH density 
anomaly over the Pacific Ocean as well. Because 
in these real-time seasonal predictions the oceanic 
conditions were initialized but the atmospheric 
and land surface were not, the successful predic- 
tion mainly derives from the oceanic state, along 
with the prescribed boundary conditions of anthro- 
pogenic forcing (8, 20). Indeed, HiFLOR predicted 
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a higher SSTA over the Atlantic, especially over 
the MDR and off the east coast of the United 
States, in addition to the moderate La Nifia con- 
ditions as observed (Fig. 1, D and F). However, 
the amplitude of the La Nifia conditions was un- 
derestimated in the real-time predictions. HiFLOR 
also largely predicted the spatial pattern of vertical 
wind shear anomalies (fig. S2, B and C) compared 
with observations (fig. S2A); however, there are 
some differences between predictions and obser- 
vations at small regional scales. 

Observations show that the MHs in 2017 pri- 
marily occurred during a confined period in the 
late summer season (26 August to 14 October; 
black solid line in fig. $4), which is a subset of the 
climatological MH season (July to October; black 


A 


Tropical Cyclones in 2017 (July-November) 


dashed line in fig. $4). This clustering indicates 
an important role for subseasonal oscillations for 
the timing of 2017 MHs (21, 22). HiFLOR failed to 
predict the timing of active MHs in 2017. Figure 
S4 shows that 2017 MHs (blue thick line) were 
predicted to occur earlier than the modeled cli- 
matological mean MH season (red thick line). 
However, we did not expect that HiFLOR could 
predict the timing of the active MHs for 2017 
because the atmosphere was not initialized in 
the predictions. Because a few ensemble members 
show active MHs during the late 2017 summer 
season (blue thin lines in fig. $4), it appears that 
HiFLOR is able to simulate subseasonal oscil- 
lations comparable to those observed (19) and 
that inclusion of atmospheric initialization may 


potentially improve predictions of MH timing 
during 2017. 


Reasons for the active 2017 MH season 


To elucidate the relative importance of SSTAs in 
various regions for the occurrence of this highly 
active 2017 MH season, we used HiFLOR to con- 
duct a series of idealized seasonal predictions for 
the period 1 July through 30 November. In these 
idealized predictions, SST in the model was re- 
stored to the SSTs from the original HiFLOR 
seasonal predictions initialized from 1 July 2017, 
but with some modifications [we refer to the SST- 
restoring experiments as “nudging experiments” 
(19) (see supplementary method c)]. In the first 
nudging experiment, which we call CLIM, we 
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Fig. 1. Observed and predicted major hurricanes (MHs) and SSTs in indices. The orange bar highlights the observed 2017 MH anomaly. (C) Observed 


2017. (A) Observed tropical cyclones during the 2017 hurricane season. 
Storm tracks are colored according to the intensities of the storms, as 
categorized by the Saffir-Simpson hurricane wind scale (TS, tropical storm; 
Cl to C5, category 1 to category 5 hurricanes). Labeled storms denote 
MHs. (B) Standardized index (o) for the observed anomaly of MH frequency 
(gray bars), along with the observed standardized index (oc) of the natural 
variability (colored lines) for the Nifio3.4 (blue), AMO (red), and AMM (cyan) 
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MH density anomaly in the 2017 hurricane season relative to the mean of 
1980-2017 (number per 2.5°-by-2.5° grid box per season). (D) Observed SST 
anomaly (SSTA; units: K) in the 2017 hurricane season relative to the mean 

of the period 1982-2012. The black frames demarcate the possible key regions 
for the unusually high MH activity in 2017. (E and F) As in (C) and (D) but for 
ensemble mean of real-time seasonal predictions from 1 July 2017 initial 
conditions, as predicted by HiFLOR. 
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Fig. 2. Prescribed idealized SSTA and predicted MH density. Idealized seasonal forecasts were 
conducted by prescribing the idealized SSTs in which SSTAs (left panels; units: K) are superimposed 
onto the climatological mean SST (CLIM). The resultant predicted MH density anomalies (MHDAs) 
relative to the CLIM experiment are shown by the shading in the right panels (units: number per 
season). The prescribed SSTAs are: (A) all 2017 anomalies (CLIM+); (B) as in CLIM+, except the 


Pacific SSTAs are set to zero (PCLIM); (C) as in CLI 


+, except the Pacific SSTA is replaced with the 


SSTA predicted from initial conditions on 1 April 2017, predicting El Nifio conditions (PEL); (D) as in 
CLIM+, except the Atlantic SSTA is set to zero (ACLIM); (E) as in CLIM+, except the SSTA off the 
coast of North America is set to zero (GCLIM); and (F) as in CLIM+, except the SSTA in the tropical 
Atlantic is set to zero. Contours in the panels at right denote the mean MH density predicted from 
the CLIM experiment. The contour interval is 0.6 per season. Cross symbols in the right panels 


indicate that the predicted change relative to the 


CLIM experiment is statistically significant at the 


90% confidence level or above [bootstrap method proposed by Murakami et al. (32)]. 


restored the SST to the predicted climatological 
mean SST computed over the period 1982-2012. 
In the second experiment, called CLIM+, we re- 
stored the SST to that predicted for 2017 (Fig. 2A). 
The CLIM and CLIM+ simulations yielded about 
three and six MHs, respectively (Fig. 3). These sim- 
ulated values were close to the observed values 
for the climatology (2.7) and the 2017 season (6). 
Moreover, the pattern of the predicted MH den- 
sity anomaly in CLIM+ (right panel of Fig. 2A) 
reflected the observed 2017 MH density anomaly 
in the North Atlantic (Fig. 1C). The fidelity of the 
MH prediction by HiFLOR gave us confidence in 
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carrying out further sensitivity experiments with 
the model. 

In the next sensitivity experiment, we used 
the SSTs from CLIM+, except the SSTs over the 
Pacific Ocean were replaced with those from 
CLIM. We refer to this experiment as PCLIM, 
and the results are shown in Fig. 2B. If the 2017 
La Nifia conditions indeed led to the particularly 
active MH season in 2017, the predicted MH fre- 
quency should have been reduced in the PCLIM 
experiment. However, the results (Figs. 2B and 3) 
show that this was not the case. Therefore, the 


moderate 2017 La Nifia conditions were not the 


16 November 2018 


key factor for the highly active MH season in 
2017. Unlike the predictions started from initial 
conditions on 1 July 2017, the HiFLOR real-time 
seasonal predictions started from initial condi- 
tions on 1 April 2017 predicted a strong E] Nino 
development for the 2017 summer season. This 
false alarm was also predicted by other seasonal 
models (23), possibly associated with the so-called 
“spring predictability barrier” (24). In our next 
sensitivity test, which we refer to as experiment 
PEL, we replaced the CLIM+ SST over the Pacific 
Ocean with the predicted SST from the April 
forecasts to emulate a strong El Nifio condition. 
Although the MH frequency in PEL decreased 
slightly relative to CLIM+, the PEL prediction 
still showed active MHs over the North Atlantic 
(Fig. 2C and 3), indicating the possibility of high 
MH activity in the Atlantic, even with El Nifio con- 
ditions during summer 2017. These experiments 
support the assertion that the Pacific SSTA in 
2017 was not a critical factor for the particularly 
active MH season in the North Atlantic that year. 

Next, we conducted an experiment similar to 
CLIM+ but with the Atlantic SST replaced by the 
CLIM SST (ACLIM; Fig. 2D). The ACLIM predic- 
tion showed substantial reductions in both MH 
number and density (Figs. 2D and 3), suggesting 
that the local Atlantic SSTA was critical for the 
high MH activity in 2017. Furthermore, we sep- 
arately replaced the SST with the CLIM SST in 
the region off the east coast of the United States 
(GCLIM;; Fig. 2E) and in the MDR (MCLIM;; Fig. 
2F) and found that replacing the MDR SSTA 
substantially reduced the MH frequency and 
MH density (Figs. 2F and 3). In contrast, results 
from GCLIM led us to conclude that the SSTA off 
the East Coast of the United States was not a 
major factor controlling the unusually high MH 
activity in the 2017 hurricane season, in terms of 
the number of MHs. 

Note that we utilized the predicted SSTs, rather 
than observed SSTs, for the lower boundary con- 
ditions throughout this study (supplementary 
method c). This is because we started this attri- 
bution study before the end of the 2017 hurricane 
season (i.e., real-time event attribution study), 
when observed SSTs were unavailable. Mean- 
while, we confirmed that the same conclusions 
could be obtained even by using observed SSTs 
(figs. S5 and S6). 


Effect of anthropogenic forcing on 
active MH season 


Another open question is to what extent the 
increase in anthropogenic forcing influenced the 
emergence of this particularly active MH season 
in 2017. To investigate this, we followed the ideal- 
ized seasonal-prediction framework but addition- 
ally considered potential future conditions (see 
supplementary method c). The representative 
concentration pathway 4.5 (RCP4.5) and RCP8.5 
experiments, in which future changes in mean 
SSTs according to CMIP5 models (fig. S7) were 
superimposed onto the CLIM SST, respectively 
resulted in more-frequent MHs relative to CLIM 
by about 1.5 and 2.0, with statistical significance 
(P values of < 0.05 and < 0.01) (Fig. 3), and a 
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basin-wide increase in MH density (Fig. 4, A and 
B), albeit with both RCP4.5 and RCP8.5 also 
showing a marked increase in MH density in the 
northern Atlantic and Gulf of Mexico. These re- 
sults imply a general increase in MH occurrence 
induced by anthropogenic forcing. The RCP4.5 
and RCP8.5 runs showed a slight reduction in 
MH density along the location of maximum MH 
density in CLIM (i.e., maximum contour in Fig. 4, 
A and B), but the reduction was not statistically 
significant. This noisy spatial pattern of changes 
may have been caused by the small sampling size 
in the predictions (i.e., 12 ensemble members). 
To increase the sample size, we conducted another 
set of long-term (200-year) control simulations 
by fixing the level of anthropogenic forcing to 
those in 2015 (2015Cntl) and 1940 (1940Cntl) 
(see supplementary method b). These simulations 
showed a basin-wide increase in MH density in 
2015Cntl relative to 1940Cntl (Fig. 4C). 

In addition to the RCP4.5 and RCP8.5 experi- 
ments, we superimposed the 2017 SSTA (Fig. 2A) 
onto the SSTs from the RCP4.5 and RCP8.5 
experiments to mimic the impact of the 2017 
spatial distribution of SSTAs if that pattern were 
to occur in a future warmer climate. These ex- 
periments, which we refer to as RCP4.5+ and 
RCP8.5+, showed an increase in the number of 
MHs by about three relative to their future mean- 
state experiments (i.e., RCP4.5 and RCP8.5) (Fig. 
3). The increase of three MHs was similar to that 
from the present-day experiments (i.e., CLIM+ 
minus CLIM in Fig. 3), suggesting that the in- 
crease in MH activity induced by the 2017 SSTA 
was not highly sensitive to the mean climate 
state. The increase in MHs induced by the 2017 
SSTA (i.e., +3 MHs) was larger than that by the 
mean-state changes (i.e., +1.5 to 2.0 MHs; Fig. 3), 
indicating that the 2017 SSTA had a greater in- 


A RCP4.5 — CLIM 


fluence than the mean-state change on the un- 
usually high MH activity that year. The MH density 
anomalies projected by RCP4.5+ and RCP8.5+ 
relative to their mean-state experiments (Fig. 4, 
D and E) show a spatial pattern similar to that of 
the 2017 MH anomaly (Fig. 4F) but with a higher 
MH density over the Caribbean and near the U.S. 
coast, amplifying the risk of MHs over these re- 
gions, as well as a higher MH density over the 
open ocean in the northern North Atlantic. 
Another possible factor responsible for the 
unusually active MH season in 2017 was the 
external influence of anthropogenic aerosols. 
Dunstone et al. (25) reported that their dynam- 


ical model revealed that anthropogenic aerosols 
lowered the frequency of tropical storms during 
the 20th century, whereas sharp declines in an- 
thropogenic aerosol levels over the North Atlantic 
at the end of the 20th century increased the num- 
ber of tropical storms. On the basis of climate 
model simulations, recent studies (26, 27) also 
reported that the potential intensity, the theo- 
retical upper limit of the storm intensity given 
the large-scale environment, recently started to 
increase because of reductions in anthropogenic 
aerosols over the global domain (27) as well as 
in the North Atlantic (26), indicating potential 
increases in MH frequencies due to decreases in 
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Fig. 3. Box plots for the predicted MH frequency over the North Atlantic, according to various 
prescribed SSTA patterns. Red squares denote the ensemble mean, whereas the black dots 
represent each ensemble member. The boxes indicate the lower and upper quartiles, the horizontal 
lines in the middle show the median value, and the horizontal end lines show the lowest (highest) 
datum still within the 1.5 interquartile range of the lower (upper) quartile. 
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Fig. 4. Projected changes in MH density in the future experiments. 
Projected change in the July to November mean MH density (number per 
season), on the basis of (A) RCP4.5 relative to CLIM, (B) RCP8.5 relative 
to CLIM, (C) 2015Cntl relative to 1940Cntl, (D) RCP4.5+ relative to RCP4.5, 
(E) RCP8.5+ relative to RCP8.5, and (F) CLIM+ relative to CLIM. Cross 
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symbols indicate that the predicted change is statistically significant at the 
90% confidence level or above [bootstrap method (32)]. Contours show 
the predicted values for the reference experiment [i.e., CLIM for (A), (B), and 
(F); 1940Cnt! for (C); RCP4.5 for (D); and RCP8.5 for (E)]. The contour 
interval is 0.6 per season. 
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Fig. 5. Relationship between predicted MH frequency and prescribed MDR SSTA or RSSTA. 
(A) MH frequency and MDR SSTA for the present-day experiments. (B) As in (A) but for the MH 
frequency and MDR RSST (relative SST). (C and D) As in (A) and (B) but with the future experiments 
as well as the present-day experiments included. The small (large) symbols denote each ensemble 
member (the ensemble mean). The black line is the linear regression line. The correlation coefficient 
(r) and linear regression equation are displayed at the top of each panel. 


anthropogenic aerosol forcing. To investigate the 
influence of aerosols on the MH frequency, we 
conducted an additional idealized seasonal nudg- 
ing prediction, with settings to those of CLIM+, 
except that the estimated SSTA due to a reduced 
concentration of anthropogenic aerosols (fig. S8) 
was superimposed onto the CLIM+ SST (hereafter, 
AERO+; see supplementary method c). The AERO+ 
experiment did increase the MH frequency slight- 
ly (by +0.8) relative to the CLIM+ experiment 
(Fig. 3), qualitatively supporting the conclusion 
of previous studies (25-27), even for the MH fre- 
quency. However, the increase is not statistical- 
ly significant (P value = 0.19). As discussed by 
Murakami e¢ al. (28), the projected impact of 
aerosols on MH frequency may be underestimated 
in HiFLOR because the model may underestimate 
the radiative forcing by aerosols due to a lack of 
representation of the indirect effects of aerosols. 
Further refinement of the model’s physics is nec- 
essary to better estimate the impact of aerosols 
on MH frequency. 
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Relative importance of relative SSTAs 

on active MH season 

Previous studies (29, 30) have reported the fre- 
quency of hurricanes (i.e., those weaker than 
MHs) to be positively correlated with the MDR 
SSTA (the SSTA in domain B in Fig. 1C; 10° to 
25°N, 80° to 20°W), as well as the relative SSTA 
(RSSTA), which is defined as the difference be- 
tween the mean SST over the MDR and the mean 
SST over the global tropics (30°S to 30°N). This 
is also true for MHs. The correlation coefficient 
between observed MH frequency and MDR 
SSTA (RSSTA) was +0.50 (+0.61) for the period 
1979-2017. The standardized value for MH fre- 
quency for the 2017 summer season was 1.1 (0.8) 
standard deviations above its long-term mean 
(fig. S1). A recent study (15) also reported a pos- 
itive correlation between the observed MH fre- 
quency and the RSSTA on the decadal time scale. 
However, it is uncertain whether these positive 
correlations will hold true for MHs in the future. 
The response of MHs to anthropogenic warming 
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might differ from that of weaker storms be- 
cause, as reported in several previous studies, 
the global number of weaker storms will likely 
decrease in a warmer environment in the future, 
whereas the number of intense storms will increase 
(16-18). Figure 5, A and B, shows a moderate and 
positive correlation of the predicted MH frequency 
with both the MDR SSTA and RSST, on the basis of 
a series of idealized nudging experiments in the 
present-day framework (the RCP experimental 
results are not included in these figure panels). 
The correlation (r) between the MH frequency 
and MDR SSTA was +0.56 and the slope of linear 
regression (hereafter, @) was +5.06 K"', similar 
to the values for the RSSTA (7 = +0.56 and a = 
+5.80 K~}). However, when we included the RCP 
experiments in this analysis, the values of 7 and 
a for the MDR SSTA dropped sharply from the 
original values (Fig. 5, A and C), whereas for 
RSSTA they remained almost the same (Fig. 5, 
B and D). To further clarify these relationships, 
we conducted an additional nudging experiment 
(P3K; see supplementary method c) with the 
CLIM SST plus a globally uniform +3 K (ie., zero 
RSSTA change) with other settings identical to 
the RCP8.5 experiment. P3K showed a slight in- 
crease in MH frequency relative to CLIM (Fig. 3), 
but not as large as that of RCP8.5+, resulting in 
a lowering of the positive correlation between 
the MDR SSTA and MH frequency. Overall, our 
results support the hypothesis that the MH fre- 
quency over the North Atlantic is highly corre- 
lated with the RSST—at least with this model. 


Discussions 


Overall, the results from our series of idealized 
experiments show that the enhanced MH activ- 
ity in the Atlantic in 2017 was mainly caused by 
the larger SSTA over the MDR relative to the rest 
of the global ocean, rather than by the moderate 
La Nifia conditions. However, this does not mean 
that the MH frequency and spatial distribution 
in the Atlantic are always insensitive to the El 
Nifio-Southern Oscillation (ENSO). As docu- 
mented in previous studies (79, 37), HiFLOR can 
capture the observed contrast in MH activity in 
the North Atlantic in different ENSO phases. We 
speculate that in 2017 the effect of the MDR SSTA 
was large enough to increase the MH activity re- 
gardless of the remote effects of any phase of 
ENSO. The higher tropical North Atlantic SSTA 
was possibly associated with a specific phase of 
natural variability, like the positive phase of the 
AMO or AMM. However, it is also possible that to 
some degree the 2017 SSTA contained the effects 
of anthropogenic forcing too, although we cannot 
separate the two factors at this moment. It also 
remains uncertain whether we will see more of 
these active hurricane seasons, like that of 2017, 
in the ensuing decades, despite the arguably strong 
indicator of having experienced two successive 
active MH seasons since 2016 (Fig. 1B). This is 
because a recent study (15) suggested a marked 
reduction in MHs since 2005 on the decadal time 
scale in association with a weakening of the 
Atlantic meridional overturning circulation (AMOC). 
Monitoring AMOC and RSST anomalies is key 
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to predicting the MH activity of the future, al- 
though, according to the moderate correlation in 
our study (r = +0.61) between the RSST anomaly 
and MH frequency in observations (fig. S1), the 
RSST anomaly is not the only factor determin- 
ing the MH frequency in the North Atlantic. 
Further research is necessary to address the 
physical mechanisms underpinning highly active 


MH seasons. 
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Heterobiaryl synthesis by contractive 
C-C coupling via P(V) intermediates 


Michael C. Hilton’, Xuan Zhang, Benjamin T. Boyle’, Juan V. Alegre-Requena’, 


Robert S. Paton’”+, Andrew McNally"t 


Heterobiaryls composed of pyridine and diazine rings are key components of pharmaceuticals 
and are often central to pharmacological function. We present an alternative approach 

to metal-catalyzed cross-coupling to make heterobiaryls using contractive phosphorus 
C-C couplings, also termed phosphorus ligand coupling reactions. The process starts by 
regioselective phosphorus substitution of the C—H bonds para to nitrogen in two successive 
heterocycles; ligand coupling is then triggered via acidic alcohol solutions to form the 
heterobiaryl bond. Mechanistic studies imply that ligand coupling is an asynchronous process 
involving migration of one heterocycle to the ipso position of the other around a central 
pentacoordinate P(V) atom. The strategy can be applied to complex drug-like molecules 
containing multiple reactive sites and polar functional groups, and also enables convergent 
coupling of drug fragments and late-stage heteroarylation of pharmaceuticals. 


eactions that couple two aromatic rings to 

make biaryls are among the most widely 

used processes in the pharmaceutical in- 

dustry (7, 2). Coupling of pyridines and 

diazines results in heterobiaryls, a privi- 
leged pharmacophore found in commercial drugs 
as well as numerous therapeutic candidates, such 
as the examples shown in Fig. 1A (3-5). These 
heterocycles often play a key role in drug-receptor 
binding and impart other important properties 
such as net polarity, aqueous solubility, and re- 
sistance to oxidative metabolism. Most con- 
ceivable aryl-aryl couplings can be accomplished 
using metal-catalyzed cross-coupling reactions; 
these processes feature exceptional chemoselec- 
tivity, precise regioselectivity, and sufficient ro- 
bustness to be applied to both drug discovery 
and manufacture (6-8). However, the same syn- 
thetic prowess is not transferable to heteroaryl- 
heteroaryl coupling, particularly for complex 
substrates. An alternative strategy that addresses 
the shortcomings in this fundamental bond con- 
struction would therefore offer new opportunities 
to incorporate heterobiaryls into therapeutic 
candidates. 

For de novo synthesis of heterobiaryls, a sche- 
matic for metal-catalyzed cross-couplings is shown 
in Fig. 1B (9-15). A minimum of three steps are 
required, and there are challenges in the coupling 
step, such as catalyst poisoning and decompo- 
sition of starting materials (16). Furthermore, 
drug-like molecules and intermediates often 
have multiple reactive sites and a high proportion 
of polar functional groups, such as basic amines, 
that interfere with catalytic processes and cause 
a considerable number of them to fail (15, 17). 
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Another serious problem arises from the lack of 
methods to prepare the cross-coupling precur- 
sors. Although simple heteroaryl halides are com- 
mercially available or can be straightforwardly 
prepared, direct and selective halogenation of 
pyridine and diazine derivatives encountered 
during drug development remains an unsolved 
challenge (18, 79). Similarly, synthesizing nu- 
cleophilic coupling partners such as heteroaryl 
boronic acids, stannanes, and organozinc or 
magnesium compounds is challenging, and 
they are often prepared from the correspond- 
ing heteroaryl halides to begin with (20). Cross- 
dehydrogenative couplings of heteroarenes have 
shown some promise but are currently limited 
to specific pyridine combinations and are not 
applicable in complex settings (2). 


Reaction development 


The limitations of current heterocycle coupling 
methods can potentially be overcome by con- 
tractive phosphorus C-C couplings, often termed 
phosphorus ligand coupling reactions; a test 
system is shown in Fig. 1C (22-24). The strategy 
does not rely on heteroaryl halides or partners 
such as boronic acids, but instead regioselectively 
substitutes the C-H bond in each heterocyclic 
coupling partner by successive C-P bond forma- 
tions to produce a bis-azaarene phosphonium 
salt; phosphorus ligand coupling is then trig- 
gered to form the heterobiaryl bond via a P(V) 
intermediate. Heteroaryl-heteroaryl coupling has 
previously been observed at phosphorus centers, 
but an inability to transform a generic set of pyr- 
idines and diazine precursors into the required 
bis-azaarene phosphonium salts has restricted 
these processes to specialized cases (25-29). In 
our test system, stage A combined the first het- 
erocycle, 2-phenylpyridine, with Tf,0 at low 
temperature to form an intermediate pyridin- 
ium triflyl salt (not shown); adding fragment- 
able phosphine 1 (prepared on large scale from 
diphenyl phosphine and methyl acrylate) (30) 
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results in a para-selective reaction to form de- 
aromatized intermediate Int-I (37-37). Two 
equivalents of DBU (1,8-diazabicyclo[5.4.0 Jundec- 
7-ene) eliminate first the triflyl anion to form 
phosphonium ion Int-II, and then methyl acry- 
late to form heteroaryl phosphine 2a in good 
yield. Pyridine was chosen as the second cou- 
pling partner in stage B with phosphine 2a as 
a nucleophile, resulting in bis-heteroaryl phos- 
phonium salt 3a, with complete regiocontrol. 
Several nucleophiles are known to initiate phos- 
phorus ligand coupling, including alkoxides, 
Grignard reagents, and acidic alcohol solutions 
(22, 25-29); for stage C, we found the latter to 
be most effective and two equivalents of HCl in 
EtOH at 80°C to be optimal, forming hetero- 
biaryl 4a in excellent yield with diphenylphos- 
phine oxide as a by-product (see table S1). We 
did not observe products from heteroaryl-phenyl 
or phenyl-phenyl coupling, nor ethoxylation of 
either heterocycle, in this protocol. 


Mechanistic investigation 


To investigate the reasons for selective heterocycle- 
heterocycle coupling and the kinetics of the 
ligand-coupling process, we performed a series 
of experimental and computational studies. We 
hypothesized that ethanol attacks the phospho- 
rus center and a P(V) species is formed. Subject- 
ing salt 3a to a solution of DCl in d,-methanol 
results in successive shifts of pyridine proton 
resonances per equivalent of acid by 'H NMR 
(nuclear magnetic resonance) and 31p NMR 
spectroscopy, and indicates that both pyridines 
are protonated (see figs. S17 and S18). However, 
no P(V) intermediates were detected in a"P NMR 
study under the reaction conditions. Computa- 
tional studies do predict that intramolecular 
ligand coupling occurs from P(V) intermediate 
Int-III in a stepwise fashion (see below) and 
that there is a substantial barrier-lowering effect 
(AG") upon successive protonation of Int-III 
(Fig. 2A) (38). Transition state energies consider- 
ably favor pyridine-pyridine coupling over pyridine- 
phenyl coupling for each protonation state; 
AG"<*t values show that the process is similarly 
exergonic and irreversible in each case, reinforcing 
the conclusion that selective pyridine-pyridine 
coupling results from kinetic differences in the 
ligand-coupling transition state rather than ther- 
modynamic factors. The intrinsic reaction co- 
ordinate (IRC; Fig. 2B) shows no involvement 
of alkoxy lone pairs and negligible changes to 
the other three equatorial P-C bonds. In the C-C 
bond-forming transition structure [TS-I-2H]**, 
a single P-C bond breaks, allowing one ligand 
to migrate to the ipso-carbon of another (Fig. 
2C). The intermediate formed in this key step 
((Int-IV-2H]**) is a dearomatized adduct char- 
acteristic of nucleophilic aromatic substitution, 
which is predicted to collapse irreversibly (AG = 
-39 kcal/mol) and with considerable ease (see 
figs. S9 to S11). This stepwise ligand coupling is 
therefore mechanistically distinct from the con- 
certed cleavage of two o-bonds during reduc- 
tive elimination at, for example, Pd(II) or in 
dihydrogen formation from PH;. This latter 
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detail is important because concerted coupling 
of apical-equatorial substituents from a (Dp) 
trigonal bipyramidal compound is symmetry- 
forbidden (fig. S6) (24): in contrast, this stepwise- 
coupling mechanism permits, and indeed favors, 
the migration of an apical ligand to an equatorial 
one ((T'S-I-2H}]"") (fig. $7). 

The computed structures of P(V) intermedi- 
ates, such as [Int-III-2H]°", are characterized 
by stronger, shorter (dp_ccpyy = 1.86 A) bonds to 
equatorial ligands and weaker, longer (dp_cypy) = 
1.99 A) bonds to those in apical positions. This is 
a result of three-center, four-electron bonding be- 
tween the apical ligands and the central phos- 
phorus atom. Accordingly, the relative stability 
of P(V) stereoisomeric forms can be readily 
predicted on the basis of each ligand’s capacity 
to stabilize the buildup of electron density at the 


A Me 
N N 
S Me 
om | C) 
ZA 
ji N 
HN 
Imatinib (Gleevec) (0) 


- Myelogenous Leukemia 


apical positions: o-electron-withdrawing alkoxy 
and heteroaryl groups preferentially occupy the 
apical sites (fig. S5). Weaker and more polar apical 
P-L bonds favor migration in nucleophilic 1,2- 
rearrangements, in which an equatorial ligand 
acts as the electrophilic acceptor (fig. $7), lead- 
ing to ligand coupling. Phenyl ligands are un- 
favorable for both donor and acceptor roles in 
ligand coupling: Apical positions (donors) fa- 
vor more o-electron-withdrawing substituents, 
whereas pyridyl substituents are superior accep- 
tors. This explains the complete absence of bi- 
phenyl and phenyl-heterobiaryl coupled products. 
N-protonation decreases the activation barrier 
considerably, from 30 to 20 kcal/mol, increasing 
the electrophilicity of the equatorial pyridyl group. 
Successive N-protonation further reduces the 


activation barrier to 14 kcal/mol by increasing the 
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B_Heterobiaryl synthesis: Metal-catalyzed cross-coupling schematic 
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o-electron-withdrawing power of the axial donor 
ligand and weakening the P-C bond (dp_cpy) 
increases from 1.95 A in [Int-II-H]* to 1.99 Ain 
[Int-III-2H]°*), whereas equatorial P-C bonds 
are largely unchanged (dp_cypyy is 1.87 Ain [Int- 
I-H)* and 1.86 A in [Int-II-2H}**). Computed 
values of C-O coupling from Int-[Int-III-2H]** 
are also disfavored relative to pyridine-pyridine 
coupling (AG*© = 18 kcal/mol versus AG"??? = 
14 kcal/mol; fig. S8) (3D). 

Figure 2D examines the effect of phosphorus 
electrophilicity on the rate of heterobiaryl forma- 
tion. The low energy barrier for ligand coupling in 
[Int-I1I-2H]** implies that the rate-determining 
step precedes this event and involves attack of 
the alcoholic solvent at the phosphonium center. 
We prepared a set of salts with substituted aryl 
groups that would change the electrophilicity at 


Fig. 1. Important 
heterobiaryl-containing 
drugs and synthetic 
strategies. (A) Hetero- 
biaryls in drugs. 
(B) Heterobiaryls via 
metal-catalyzed cross- 
coupling reactions. 
R denotes a general 
organic group; Hal, 
halogen substituent. 
(C) Test system for 
heterobiaryl synthesis 
via phosphorus ligand 
coupling reactions. Ph, 
phenyl; Me, methyl; Et, 
ethyl: Tf, trifluorometh- 
Ro ylsulfonyl; DBU, 

\Z~N 1,8-diazabicyclo[5.4.0] 
cages! 


ie 2 undec-7-ene; rt, 
x room temperature. 


C_Heterobiary! synthesis: Initial study into a strategy using phosphorus ligand coupling reactions 


CO,Me 
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1 
NN PPh, 
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N~ “Ph A 


Tf,0; 1; DBU, CH,Cl, 
-78 °C to rt, 3 hours 


Tf,O; 2a; DBU, CH,Cl, 


—78 °C to rt, 1.5 hours 


HCI (2 equiv), EtOH 
80 °C, 14 hours 
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phosphorus; rate data show faster heterobiaryl 
formation as the electrophilicity of the phospho- 
nium increases, in line with the above hypothesis. 
Further experimental verification of the low bar- 
riers for ligand coupling is shown in Fig. 2E. Acidic 
alcohol solutions are inefficient for heterobiaryl 
formation at lower temperatures (table S1); how- 
ever, when ethoxide is used as a nucleophile for 
facile addition to the phosphonium ion (fig. S24), 
heterobiaryl synthesis occurs in minutes at room 
temperature, with trace amounts of C-O coupling 
also observed. Substantial amounts of products 
resulting from protiodephosphination are formed 
under these conditions, making this protocol less 
practical than that under acidic conditions. 


Substrate scope exploration 


We next selected a set of pyridines and diazines to 
examine which substitution patterns and func- 
tional groups could be tolerated in the ligand 


coupling process (Fig. 3). The reaction is com- 
pletely selective for the 4-position of pyridines 
in the vast majority of cases studied, unless a 
4-substituent is present, which switches selectiv- 
ity to the 2-position. A variety of 4,4'-bipyridines 
are accessible using this strategy (4b-4f); func- 
tional groups such as esters, trifluoromethyl 
groups, and methoxy groups are accommodated, 
as are halides that would normally be active in 
metal-catalyzed reactions. Substituents can be 
present at the 2- or 3-positions of pyridines, and 
example 4e shows that a 2-position substituent 
is not a requirement (see below). A fluorinated 
2,4’-quinoline-pyridine was also synthesized by 
phosphorus ligand coupling (4g) (39). Examples 
of 2,2’-systems, 4h and 4i, showcase an alter- 
native to Suzuki couplings, where 2-pyridyl and 
quinolyl boronic acids often decompose during 
metal-catalyzed reactions (16). Pyrimidine- and 
pyrazine-containing heterobiaryls 4j and 4k 


were formed via the three-step sequence, with 
lower yields in the coupling step relative to pyri- 
dine examples. 


Reaction guidelines 


During these studies, we have established a gen- 
eral set of reaction guidelines and limitations. 
First, when coupling 2-substituted pyridines to 
3-substituted pyridines, it is important to per- 
form the salt-forming sequence in the correct 
order (Fig. 3). Taking heterobiaryl 4b as a repre- 
sentative example, if heteroaryl phosphine 2b’ is 
used instead in stage B, then salt 3b is not formed. 
We believe that a biased Tf-salt equilibrium rapid- 
ly develops, and pyridinium-phosphine [2b’-Tf]* 
is favored on steric grounds; the 2-substituted 
pyridine is then not activated for nucleophilic 
addition, and the desired salt is not formed (fig. 
$25). Instead, the 2-substituted pyridine should 
be converted into the corresponding phosphine 
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Fig. 2. Computational and experimental investigation of phosphorus 
ligand coupling. (A) Biaryl-coupling activation barriers [coupled cluster/ 
density functional theory SMD-DLPNO-CCSD(T)/cc-pV(DT)Z//mwB97XD/ 
6-31+G(d), kcal/mol] decrease upon protonation and consistently favor 
heterobiaryl formation. Py, pyridine. (B) Computed bond orders show 

a single (apical) P—C(py) bond breaking along the reaction coordinate, 
with little involvement of oxygen lone pairs. LPo,, number of electrons in 
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each oxygen lone pair. (C) Optimized structures for [Int-III-2H]**, 
[TS-I-2H]*", and [Int-IV-2H]** show stepwise apical-equatorial ligand 
coupling. (D) A kinetic study indicates that alcohol addition is rate-limiting. 
TfOH was used in place of HCI because of poor solubility of aryl 
derivatives. Yields after complete consumption of the phosphonium salts 
were approximately the same in each case (89 to 94%). (E) Room- 
temperature coupling using ethoxide as a nucleophile. 
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Fig. 3. Azaarene scope and guidelines for phosphonium salt formation. Yields of isolated products after each stage are shown. n-Bu, normal butyl 
group; n-Pr, normal propyl group. Reaction guidelines are shown for phosphonium salt formation invo 
partners. Further details of challenges and limitations are highlighted in fig. S25. 


and used as a nucleophile with the 3-substituted 
pyridine in stage B. Second, problematic sub- 
strates for heteroaryl phosphine and salt for- 
mation include pyridines with 2-trifluoromethyl 
groups, 4-alkyl or aryl substituents, and 2,6- 
disubstituted pyridines. In general, pyridines and 
diazines with more than two electron-withdrawing 
groups or electron-donating groups can result in 
low yields or no phosphonium salt formation. 
During ligand coupling, we have observed that 
pyridines substituted with bromides and io- 
dides can be dehalogenated, that 2-chloro- or 
2-fluoropyridines are not successful, and that 
2-methoxypyridines proceed with slower rates. 
For pyridines containing electron-withdrawing 
groups, using EtOH and HCI can result in eth- 
oxylation. Changing the acid to TfOH avoids this 
problem and leads us to believe that ethoxylation 
results from chlorination followed by ethoxylation 
via nucleophilic aromatic substitution. Trifluor- 
oethanol is preferred when molecules contain 
functional groups such as amides and esters 
that are susceptible to ethanolysis. In general, 
one equivalent of acid per basic nitrogen is 
optimal (see below). 


Application to complex intermediates 


Our attention then turned to ligand couplings 
involving complex azaarenes (Fig. 4). Convergent 
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couplings of pyridine-containing fragments were 
first examined; these molecules are representative 
of drug leads, which are promising candidates for 
a therapeutic target but have suboptimal phar- 
macokinetic and pharmacodynamic properties 
(40). A convergent coupling strategy would en- 
able rapid access to complex heterobiaryls from 
compounds common in pharmaceutical libraries 
(41, 42). Four examples in Fig. 4 are shown where 
the corresponding halide precursors are not com- 
mercially available or would be challenging to 
prepare (41-40). Heterobiaryl bonds are formed 
with precise regioselectivity, and the presence of 
additional saturated and unsaturated nitrogen 
heterocycles is tolerated in this approach. Three 
or four equivalents of acid are used in the cou- 
pling step in these cases to ensure adequate reac- 
tion rates. 

Next, we investigated whether the ligand- 
coupling strategy could be applied to advanced 
intermediates in drug development. Success in 
this endeavor would offer distinct strategies to 
introduce heterobiaryls into complex molecules 
and alleviate concerns over metal contamina- 
tion in subsequent biological testing. To demon- 
strate the feasibility of this approach, we chose 
a set of existing drug molecules with diverse 
structures, substitution patterns, and functional 
groups (43). 
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ving ortho and non-ortho substituted pyridines as 


The use of previously synthesized heteroaryl 
phosphines (Fig. 4) shows that heteroarylation is 
possible in these complex systems with complete 
control of regioselectivity and site selectivity. 
Chlorphenamine, a common antihistamine, and 
loratadine, an allergy medicine, are competent 
substrates for this protocol, with the resulting 
heterobiaryls isolated in good overall yields (4p 
and 4q) that again highlight how halides can 
be tolerated during the coupling procedure. 
Vismodegib was converted into a 2,4’-quinoline- 
pyridine system in moderate yield (41r). A widely 
applied fungicide, quinoxyfen, was also compa- 
tible with the reaction protocol (4s). Etoricoxib 
and imatinib are challenging examples because 
they contain multiple reactive sites (34). The 
structural features in etoricoxib enable selective 
transformation of the 2,5-disubstituted pyridine 
(4t), and heteroarylation of the pyridine occurs 
selectively over the pyrimidine in imatinib to 
form 4u. 


Outlook 


This phosphorus ligand coupling method over- 
comes major limitations of metal-catalyzed 
approaches by virtue of its compatibility with 
polar functionalities found in drug-like mol- 
ecules and its circumvention of preformed het- 
eroaryl halides and boronic acids. As well as 
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Fig. 4. Heterobiaryl synthesis in complex molecules. Yields of isolated products after each stage are shown. Further examples of advanced stage 
couplings are shown in fig. S26. 


transforming building block compounds, con- 
vergent coupling of drug fragments and hetero- 
arylation of complex pharmaceuticals were 
demonstrated. The protocol uses readily avail- 
able reagents under simple conditions and is 
immediately applicable in medicinal chemistry. 
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Templated nanofiber synthesis via 
chemical vapor polymerization into 
liquid crystalline films 


Kenneth C. K. Cheng””*+, Marco A. Bedolla-Pantoja®*, Young-Ki Kim®**, 


Jason V. Gregory”, Fan Xie’, Alexander de France” 


Nicholas L. Abbott®*}, Joerg Lahann””*>»*t 


, Christoph Hussal®, Kai Sun”, 


Extrusion, electrospinning, and microdrawing are widely used to create fibrous polymer 
mats, but these approaches offer limited access to oriented arrays of nanometer-scale 
fibers with controlled size, shape, and lateral organization. We show that chemical vapor 
polymerization can be performed on surfaces coated with thin films of liquid crystals 

to synthesize organized assemblies of end-attached polymer nanofibers. The process uses 
low concentrations of radical monomers formed initially in the vapor phase and then 
diffused into the liquid-crystal template. This minimizes monomer-induced changes to the 
liquid-crystal phase and enables access to nanofiber arrays with complex yet precisely 
defined structures and compositions. The nanofiber arrays permit tailoring of a wide range 
of functional properties, including adhesion that depends on nanofiber chirality. 


urfaces decorated with oriented arrays of 
fibers are ubiquitous in the natural world 
because they can provide functions such 
as sensing [hair cells (7)], thermal insula- 
tion [polar bear fur (2)], enhanced mass 
transport [microtubules (3)], extreme wetting 
properties [lotus leaf (4)], and reversible adhe- 


sion [gecko foot (5)]. However, re-creation of 
these functions in synthetic materials requires 
multiscale engineering of the composition, shape, 
and morphology of individual fibers, as well as 
control of higher-order organization of fibers into 
arrays. We address this challenge by building 
from studies reported in 1916 by T. Svedberg (6), 


who used the long-range molecular order and 
fluidity inherent to liquid crystals (LCs) to con- 
trol chemical reactions, principles that have since 
been exploited in a wide range of transformations 
based on unimolecular reactions (7), molecular 
self-assembly (8), or polymerizations (9, 10). A 
key limitation of LC-templated polymerization, 
however, has been perturbation of the LC phase 
by monomers that are dissolved into the LC 
before the polymerization and then consumed 
during polymerization (9, 10). 

Chemical vapor polymerization (CVP) is a versa- 
tile process that is compatible with a range of 
polymerization modes (Fig. 1A), including reaction 
pathways using [2.2]paracyclophanes (17-13) 
(Gorham process) or halogenated xylene precursors 
(Gilch process) (14, 15) and free-radical ring- 
opening copolymerization (ROP) using 5,6-benzo- 
2-methylene-1,3-dioxepane and [2.2]paracyclophanes 
(16). CVP is widely used for fabrication of con- 
sumer products (e.g., electronics and packaging) 
because it enables rapid and inexpensive conformal 
coating of large surface areas with polymer films (12). 
Whereas past studies used CVP to form continuous 
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F Fig. 1. templated synthesis of nanofiber arrays 


via CVP into anisotropic media. (A) CVP of 1a to 
Ih yields polymers 2a to 2h. m, n, and |: 
copolymer repeat units; A: 250°C. (B and 

C) Representative chemical structures of 
cyanobiphenyl-based (5CB and E7) (B) and halo- 
genated (TL205) (C) LCs. (D) Fabrication of polymer 
nanofibers via CVP into a LC phase aligned 
perpendicular to the substrate. (i) CVP; 

(ii) LC removal. (E) Scanning electron microscopy 
(SEM) images of nanofibers polymerized 

from 1a (10 mg) in 5CB. After the nanofiber 
synthesis, the LC template was removed. 

(F) Optical micrograph (crossed polars) of a nano- 
fiber. Orientations of the analyzer (A) 

and polarizer (P) are shown in the white 
double-arrow cross. The yellow double arrow indi- 
cates the main axis of the nanofiber. 

(G and H) Micrographs (crossed polars) of the 
nanofiber with a quarter-wave plate with its 

slow axis (y, green double arrow) perpendicular (G) 
or parallel (H) to the fiber axis; lower-order interfer- 
ence colors [yellow in (G)] indicate a decrease in 
retardance. (I) Analysis of interference colors of the 
nanofiber in (G) and (H) indicates that the polymer 
chains are aligned along the fiber axis. 
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polymeric films on surfaces (7-16), we found 
that CVP of compound 1a into micrometer-thick 
supported films of nematic LCs (Fig. 1, B and C) 
resulted in the formation of surfaces decorated 
with aligned arrays of nanofibers (Fig. 1, D and E). 
When nematic 4’-pentyl-4-biphenylcarbonitrile 
(5CB) was anchored on a silane-functionalized 
glass surface in a perpendicular orientation (i.e., 
homeotropic anchoring, Fig. 1D), the nanofibers 
were straight and aligned perpendicular to the 
surface. Removal of the LC [confirmed by Fourier 
transform infrared (FTIR) spectroscopy and solid- 
state °C nuclear magnetic resonance (NMR) 
spectroscopy, figs. S1 and S2] revealed that the 
nanofibers were made of insoluble polymer, were 
anchored at one end to the surface, and were 
structurally amorphous (fig. S3) yet optically bire- 
fringent, as confirmed by cross-polarized light 
microscopy (Fig. 1F). Insertion of a quarter-wave 
plate between crossed polarizers (Fig. 1, G and H) 
revealed that the refractive index was greatest 
along the fiber axis, consistent with electron 
diffraction patterns (fig. S3B), indicating align- 
ment of polymer chains along the main axis of 
the nanofibers (Fig. 11) (7). 

CVP into films that lacked fluidity (crystal- 
line solid 5CB, Fig. 2A) or long-range order (iso- 


tropic liquid 5CB, Fig. 2C; or silicone oil, Fig. 2D) 


Auli 
IN 
\ Mtl nN 
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PUNO 


Ht 


AAAI BB 
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NEEM | 
AAAS 
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did not yield nanofibers, indicating that long- 
range order and fluidity are both necessary re- 
quirements for the shape-controlled synthesis 
of nanofiber arrays (Fig. 2B). Replacement of 
5CB with nematic E7 (a mixture of cyanobi- 
phenyls, Fig. 1B), TL205 (a mixture of halogen- 
ated molecules, Fig. 1C), or other nematic LCs 
(fig. S4) yielded organized assemblies of nano- 
fibers with well-defined yet distinct diameters 
(D) of 142 + 11 nm in 5CB, 85 + 9 nm in E7, and 
69 + 7nm in TL205 (Fig. 2E). We hypothesized 
that the nanofiber diameter is controlled by the 
extrapolation length €, which is defined as K/W, 
where K is the average Frank elastic constant 
for splay and bend of the LC and W is the 
surface anchoring energy density. If an inclu- 
sion (here, nanofiber) in a LC grows to a size 
that exceeds &, the orientation-dependent inter- 
facial energy associated with interaction of the 
LC with the surface of the inclusion (WD”, where 
diameter D is the inclusion size) exceeds the 
energetic cost of elastic deformation of the LC 
(KD), and the LC will elastically deform around 
the inclusion (18, 19). We tested the hypothesis 
that D = K/W by using literature values of K at 
temperature (7) = 25°C (18, 20) and our exper- 
imental values of D to calculate W (Fig. 2E). 
Calculated values of W were ~10°* J/m? (20), 
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with Wry205 > Wrz > Wscp, a ranking that is 
consistent with (i) the theoretical prediction 
that Wo (Ty; - T)?°, where the material con- 
stant B = 0.4 to 0.5 (22, 23) and Ty; is the 
nematic-isotropic phase-transition temperature 
of the LCs (Fig. 2E), and (ii) independent ex- 
periments that measured the relative values of 
W of these LCs (fig. S5). We also found the 
average diameters of the nanofibers to depend on 
the temperature of the LC during CVP. For ex- 
ample, D increased with T (Fig. 2E and fig. S6), as 
theoretically predicted by = K/W ~ (Ty, - Ty? 
(22, 23). Overall, these results are consistent 
with a mechanism of growth in which the elastic 
energy of the LC defines the nanofiber diameter 
(via &) and promotes preferential growth of 
the nanofibers along the alignment direction of 
the LCs. 

We performed CVP of 1a using homeotropi- 
cally oriented E7 films with thicknesses ranging 
from 5 to 22 um and found that the lengths of 
the fibers closely matched the LC film thick- 
nesses (Fig. 2F). The result confirms growth of 
the nanofibers along the LC (figs. S7 and S8). 
We also found that monomers with a wide 
range of chemical functional groups could be 
polymerized in LCs by CVP (Gorham process), 


yielding (i) ethynyl-functionalized nanofibers 


Fig. 2. Templated CVP of nanofibers with 
precise lengths, diameters, and surface 
chemistries. (A to D) SEM images of nanofibers 
formed by CVP of 1a (10 mg) with the indicated 
templates (see insets): crystalline 5CB at 

13°C (A), nematic 5CB at 25°C (B), isotropic 
5CB at 37°C (C), and isotropic silicone oil at 25°C 
(D). The LC thickness was 21.7 + 0.5 um. 

(E) Nanofiber diameters (left axis) obtained by 
CVP of 1a (6 mg) into E7 at 13°C (down triangle), 
25°C (circle), 30°C (up triangle), and 5CB and 
TL205 at 25°C (circles). Red X's are the 
calculated surface anchoring energy densities 
(W, right axis) for each LC at 25°C. The inset 
table shows elastic constants (K) at 25°C 

and the nematic-isotropic phase transition 
temperatures (Ty) of TL205, E7, and 5CB. 

(F) Nanofiber length as a function of either 
nematic E7 film thickness (black points) or mass 
of polymerized 1a for a LC film with thickness 
of 21.7 + 0.5 um (red points). Mean + SD, n = 
10 measurements. (G to I) Representative SEM 
images of 2b (G), 2c (H), and 2d (1) templated 
into TL205. (J to L) Representative FTIR spectra 
of 2b (J), 2c (K), and 2d (L) templated into 
TL205. FTIR spectra of the nanofibers (red) 
are compared to polymer films synthesized 
without the LC phase (blue). LCs were removed 
before imaging and FTIR spectroscopy. 

a.u., arbitrary units. 
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for click-based reactions with azide derivatives 
(2b, Fig. 2G), (ii) nanofibers that simultaneously 
present ethynyl and hydroxyl groups for reac- 
tion with azides and activated carboxylic acids 
(2e, Fig. 2H), (iii) nanofibers without func- 
tional groups (2d, Fig. 21) as a nonreactive ref- 
erence, (iv) polycationic pyridine-functionalized 
nanofibers (2e, fig. S9A), and (v) water-repelling 


50 ym 


Lt 


Wit 


perfluoro-functionalized nanofibers (2, fig. S9B). 
Additionally, we used CVP into LCs to generate 
polymeric nanofibers with distinct main chains, 
including (vi) biodegradable polyester nano- 
fibers (2g, ROP process, fig. S9C) and (vii) semi- 
conducting poly(phenylene vinylene) nanofibers 
(2h, Gilch process, fig. S9D). When templated by 
nematic TL205, nanofibers made of polymers 


Fig. 3. Influence of LC template on nanofiber morphology and organization. (A to D) The left 
column shows optical micrographs (top view, crossed polars) of LC templates; insets are schematic 
illustrations (side view) of molecular order within the LC templates. The right two columns show 
SEM images of nanofibers templated from the LCs. (A) Nematic film of E7 with hybrid anchoring and 
resulting banana-shaped nanofibers. (B) Homeotropically oriented film of a smectic A LC phase 
and the resulting polymeric nanostructures. (C) Micrograph showing cholesteric LC phase of 

E7 doped with a left-handed chiral dopant (S-811). SEM images in middle and right columns show 
nanofibers templated from E7 containing left-handed (S-811) and right-handed (R-811) dopants, 
respectively. The black and blue arrows in the inset indicate the helical axis and handedness of the 
twist, respectively. (D) Blue phase LC (BP1) with a cubic lattice spacing of ~250 nm and the resulting 
polymeric nanostructure. The inset in the far-right column shows a bundle of helical nanofibers. 
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2b to 2h were morphologically similar (Fig. 2, G 
to I, and fig. S9, A to D) and possessed infrared 
spectroscopic signatures of the constituent chem- 
ical groups (Fig. 2, J to L, and fig. S9, E to H). 
LC ordering within films is influenced by 
interactions with confining surfaces, LC elastic 
moduli, and molecular properties of LCs, includ- 
ing chirality (19), thus offering access to a di- 
verse range of LC templates for CVP. For example, 
a film of nematic 5CB prepared with planar 
and homeotropic anchoring at bottom and top 
LC surfaces, respectively, leads to a bent and 
splayed internal ordering of the LC that tem- 
plates banana-shaped nanofibers (Fig. 3A). Smec- 
tic LCs, with a statistical layering of oriented 
molecules (79), templated straight nanofibers 
with broadened tips arranged in conical fan- 
like morphologies (Fig. 3B) (24). Chiral nematic 
phases (cholesteric) yielded shape-controlled 
and chiral nanofiber assemblies with micrometer- 
scale periodicities (11.5 + 1.5 and 11.3 + 1.5 um for 
S- and R-templated nanofibers, respectively) and 
an organization consistent with the fingerprint 
pattern characteristic of cholesteric films (peri- 
odicity of 10.8 + 1.2 and 10.7 + 1.1 1m for S- and 
R-handed cholesteric phases, respectively; Fig. 
8C and fig. S10). The chirality of the LC also 
influenced the handedness of the nanofibers, 
as confirmed by circular dichroism spectroscopy 
of surface-immobilized and solvent-dispersed 
nanofibers (figs. S11 and S12) (25). In contrast 
to all other LC phases, a three-dimensional net- 
work of helical nanofibers was formed by CVP 
into blue-phase LCs, reflecting nanofiber growth 
templated by a three-dimensional network of 
double-twisted LC and line defects (Fig. 3D) (26). 
Overall, these results reveal that the shape, in- 
terfacial orientations, and morphologies of the 
nanofiber arrays are templated by the structure 
of the LC phase. The LC transition temperatures 
remain almost unaltered by CVP (fig. S13), con- 
sistent with our conclusion (fig. $14) that the 
monomer concentration in the templating LC 
phase during nanofiber formation is low. These 
findings differ from conventional polymerizations 
in LCs, in which the dynamic interplay between 
the polymerization process and the LC template 
phase behavior makes control of the resulting 
polymeric nanostructures difficult and often leaves 
unreacted monomers in the sample (26-28). 
Figure 4, A to D, shows that CVP into con- 
formal films of nematic E7 formed over the 
outer or inner surfaces of a hollow cylinder 
yielded arrays of nanofibers (97.5 + 17.5 nm in 
diameter) anchored on the curved surfaces of 
the cylinder. On the inner surface, the density 
of the nanofibers decreased with increasing 
distance from the open end (Fig. 4D). Meso- 
scopic nanofiber islands were templated from 
micrometer-sized LC droplets electrosprayed 
onto surfaces, thus providing a scalable approach 
for fabrication of arrays (Fig. 4, E and F). We also 
found that free-standing LC films formed within 
metallic meshes or at the ends of capillaries 
(Fig. 4G) templated organized nanofiber assem- 
blies (Fig. 4, H and I). Additionally, microbeads 
dispersed in LC phases before CVP supported 
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Fig. 4. Templated CVP of polymeric nanofibers in complex geometries. (A to D) CVP of 1a into 
nematic E7 films coated on either the exterior (A) or interior (C) surfaces of glass capillaries 

and SEM images [(B) and (D)] of corresponding nanofibers [(1) indicates the region closest to 

the orifice and (2) indicates the region 1.5 mm from (1) inside the cylinder]. (E and F) CVP of 1a into 
E7 microdroplets on a glass surface (E) and SEM images of the nanofiber assemblies (F). (G and 
H) CVP of 1a into free-standing films of E7 hosted within a stainless-steel mesh (G) and SEM image 
of suspended nanofiber film (H). (I) SEM image of a nanofiber membrane spanning the tip of a 
glass capillary, which was initially intact but was opened during microscopy, revealing an ultrathin 
nanofiber array. (J) Schematic illustration of two substrates coated with nanofiber arrays prepared 
by CVP. (K) Adhesion forces between pairs of substrates decorated by flat CVP films (F) or nanofiber 
arrays templated from nematic (N), left-handed cholesteric (S) or right-handed cholesteric 

(R) LC phases. Data are means + SD; n = 5 measurements. Statistical analyses were performed 
between groups using Tukey's test; * indicates statistically identical results, P > 0.4; ** indicates 


statistically different results, P < 0.002. 


growth of nanofibers, and copolymerization was 
used to create nanofibers for coimmobilization 
of different biomolecules (fig. S15). 

Overall, our results reveal that CVP into LC 
templates enables scalable fabrication of arrays 
of polymeric nanofibers with programmable 
shapes, chemistries, and long-range lateral or- 
ganization, yielding interfacial properties cur- 
rently unavailable by other techniques. For 
example, we found that it was possible to ma- 
nipulate nanofiber chirality to control adhe- 
sion between surfaces. As shown in Fig. 4, J 
and K, we measured adhesion to be higher for 
surfaces decorated with nanofibers than the 
corresponding flat CVP films, with the relative 
chirality of the nanofibers presented by the two 
surfaces also influencing the magnitude and 
selectivity of adhesion (Fig. 4K). Other proper- 
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ties designed into nanofiber arrays prepared by 
LC-templated CVP include wettability, intrinsic 
photoluminescence, biodegradability, and sur- 
face charge (Fig. 4, J and K, and fig. S16). We 
envisage that additional functional properties 
can be realized by exploiting the full diversity 
of LC templates (17, 19) along with other polym- 
erization mechanisms using vapor-phase deliv- 
ery of monomers. 
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Reversible self-assembly of 
superstructured networks 
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Nicholas Stephanopoulos’|, Mark T. McClendon’, Cheyenne Lynsky’, 
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Soft structures in nature, such as protein assemblies, can organize reversibly into 
functional and often hierarchical architectures through noncovalent interactions. 
Molecularly encoding this dynamic capability in synthetic materials has remained an 
elusive goal. We report on hydrogels of peptide-DNA conjugates and peptides that organize 
into superstructures of intertwined filaments that disassemble upon the addition of 
molecules or changes in charge density. Experiments and simulations demonstrate that 
this response requires large-scale spatial redistribution of molecules directed by strong 
noncovalent interactions among them. Simulations also suggest that the chemically 
reversible structures can only occur within a limited range of supramolecular cohesive 
energies. Storage moduli of the hydrogels change reversibly as superstructures form and 
disappear, as does the phenotype of neural cells in contact with these materials. 


ature exploits self-assembly processes to 

promote the formation of highly organized 

structures in a hierarchical manner (J, 2). 

These structures often reorganize dynam- 

ically as interactions among their constit- 
uents change, which affects their functions (3-5). 
The design of weak and reversible interactions 
between molecules provides, in principle, a strat- 
egy to synthesize supramolecular architectures 
that can rearrange dynamically to impart changes 
in functionality. Despite recent advances in creat- 
ing artificial hierarchical systems through self- 
assembly (6-10), approaches to manipulate these 
structures reversibly across length scales that 
reach macroscopic dimensions remain elusive. 
Collagen-mimetic peptides that form hierarchi- 
cal structures have been designed in which triple 
helices of molecules interact to create fibrillar 
networks (17). However, these structures are neither 
tunable nor reversible. Another relevant recent 
example demonstrated dynamic changes in the 
unit cell of a microscopic colloidal crystal, in 
which gold nanoparticles were spatially recon- 
figured through chemically driven changes in 
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surface organic ligands (12). Synthetic bundled 
fibrous networks with the dimensional tunabil- 
ity and dynamic reversibility of collagen would 
greatly enhance our ability to design functional 
soft matter. 

We report on fibrous supramolecular networks 
that form reversible superstructures controlled 
externally by the addition of soluble molecules. 
The system consists of nanofibers formed by co- 
assembly of alkylated peptides (monomer 1) with 
a similar monomer containing a covalently linked 
oligonucleotide terminal segment (monomer 2; 
see fig. S1). Mixing 1 with 2 in molar concen- 
trations ranging from 0.1% to 10% led to the 
formation of fibers with a stochastic distribu- 
tion of monomers along its length (fig. S2). The 
original objective of this work was to create 
hydrogels in which small amounts of comple- 
mentary oligonucleotides in separate fibers would 
lead to reversible cross-linking through Watson- 
Crick base pairing (Fig. 1A). When we mixed an 
aqueous solution containing fibers with com- 
plementary oligonucleotides (1/2 and 1/2’, tables 
S1 and S2), we observed the expected formation 
of a gel that could be liquefied by adding a soluble 
single-stranded DNA that breaks the cross-links 
via the well-known toehold-mediated strand dis- 
placement (13) (fig. S3). However, we were sur- 
prised to find by scanning electron microscopy 
(SEM) a superstructure in which large micrometer- 
sized bundles of fibers segregated within a network 
of individual fibers (Fig. 1B and fig. S4). Small-angle 
x-ray scattering (SAXS) also confirmed the for- 
mation of higher-order structures (fig. S5). 

To investigate possible differences in compo- 
sition between the two apparent phases, we 
labeled oligonucleotides with a fluorescent dye 
(Cy3) to probe their distribution in the hydrogel. 
Confocal optical microscopy revealed that most 
of the DNA-containing monomers were concen- 
trated within the bundled regions (Fig. 1C). We 
first hypothesized that the system contained 
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supramolecular polymers differing in content of 
DNA-bearing monomers, which in turn spatially 
segregated to create the bundled regions. How- 
ever, we gelled solutions containing fibers with 
either monomer 2 or monomer 2’ by adding cal- 
cium chloride (electrostatic cross-linking) and did 
not find any domains with concentrated fluores- 
cence characteristic of the bundled regions (fig. S6). 

We then considered whether the formation of 
bundled regions involved large-scale spatial re- 
distribution of monomers within and among the 
fibers. Stochastic optical reconstruction micros- 
copy revealed such dynamic exchange of monomers 
in supramolecular copolymers (74). To confirm 
that DNA hybridization among neighboring fibers 
was involved in the formation of the bundles, 
we mixed aqueous solutions of fibers containing 
noncomplementary oligonucleotides, which did 
not yield bundled structures (fig. S7A). 

To establish that large-scale redistribution of 
monomers can give rise to bundle formation in a 
network of fibers, we carried out coarse-grained 
molecular dynamics simulations using a model 
that accounts for the hybridization of comple- 
mentary DNA segments (fig. S8 and tables S3 
and S4). In the simulation, each fiber is a chain 
of overlapping spheres that represent peptide 
amphiphile (PA) monomers, and some of the 
monomers are randomly grafted with DNA side 
chains. Complementary side chains can hybrid- 
ize by forming reversible bonds while dynamic 
exchange of molecules among fibers is either 
disabled or permitted (by fixing monomers within 
the fibers or allowing them to be mobile, respec- 
tively). A detailed description of the model and 
the simulation procedure is provided in the sup- 
plementary materials. Snapshots (Fig. 1, D and E) 
show that dynamic molecular exchange among 
the supramolecular polymers is essential for the 
formation of DNA-rich bundles. Forster resonance 
energy transfer (FRET) experiments on mixtures 
of assemblies containing complementary DNA 
strands and labeled with either a donor or an 
acceptor moiety confirmed that monomers from 
the two separate fiber populations exchange and 
hybridize (fig. S9). 

The simulations also provide important in- 
sights into the mechanism and kinetics of bundle 
formation. From a kinetic point of view, a hybrid- 
ization event between fibers is likely to facilitate 
additional cross-linking locally of other DNA 
segments in neighboring locations. Furthermore, 
hybridized monomers have a lower tendency 
to escape to other fibers, measured in the simu- 
lations as “trapping time” (fig. S10A). Likewise, 
the diffusivity of DNA monomers decreased once 
they were recruited into the incipient bundles of 
the superstructure (fig. S10B). We infer that such 
mechanisms should lead to the growth of stable 
bundled regions. In experiments using monomers 
labeled with the cyanine dye Cy3, we followed the 
kinetics of bundle formation and found that 
micrometer-scale bundles formed within 10 min 
(fig. S9, A and B). 

The simulations also showed that the bundle 
growth rate (fig. S11) is sensitively controlled 
by the relative strength of molecular attraction 
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among monomers within the fiber versus the 
energy associated with hybridization. Molecu- 
lar attraction within the fibers is controlled by 
the energy associated with B-sheet formation and 
hydrophobic collapse of aliphatic segments in PA 
molecules (intra), Whereas interaction between 
fibers is mediated by hybridization energy (Finter)- 
Additionally, the simulation predicted that fiber 
bundles form through redistribution of mono- 
mers when Ejntra lies within the remarkably nar- 
row range of 5 to 10 kg7, where kT is the thermal 
energy (the product of the Boltzmann constant 
kz and temperature T) (Fig. 1F). Thus, cohesion 
among molecules needs to be strong enough to 
create stable fibers but not too strong to prevent 
dynamic exchange. Within this range, the model 
also showed that interfiber cross-linking requires 
a threshold energy to create bundled regions 
(Einter > 5 kgT; fig. S12A). Below this threshold, 
the DNA monomers are predicted to distribute 
randomly along fibers, resulting in a homoge- 
neous disorganized structure (fig. S12B). By ex- 
plicit estimation of the free-energy differences 
(see supplementary materials), we confirmed that 
the molecular design of monomers 2 and 2’ 
indeed fell in the predicted regime for bundle 
formation. When dynamic exchange was sup- 
pressed (Ejntra > 10 KpT), very small bundles could 
still form, provided that cross-links could break 
and rehybridize (fig. S12C). As redistribution of 
monomers does not occur, the growth rate is 
naturally limited by the low density of DNA mono- 
mers in the fibers. Additional experiments vary- 
ing Eintra and Ejnter using different molecules 
supported our computational predictions and 
demonstrated the experimental tunability of the 
system investigated (figs. S13 to S18). 

We also explored, both experimentally and 
through simulations, the effect of molar concen- 
tration of DNA monomers on bundle formation 
(figs. S7, B to D, and S19). When DNA densities 
were too low, few fibers were cross-linked and 
the resulting monomer redistribution was not 
sufficient to support appreciable bundling. As 
the density increased, the clustering of DNA- 
containing monomers drove formation of larger 
bundles. Above a given DNA concentration, the 
system “froze” kinetically into a three-dimensional 
(3D) gel without any bundled structures. 

Having obtained evidence that the super- 
structures form when the systems contain low 
amounts of DNA-containing monomers, we were 
interested in investigating supramolecular as- 
semblies in which all of the molecules are func- 
tionalized with complementary oligonucleotides. 
These systems would experimentally mimic the 
final DNA-rich superstructures created dynam- 
ically in the hydrogels. We followed the time 
evolution of these systems using electron micros- 
copy and discovered that both DNA-containing 
monomers in pure form self-assembled into spheri- 
cal micelles (fig. S20, A and B). In our view, this 
result is not surprising given the large size and 
charge of the DNA segments. However, when 2 
and 2’ were mixed and annealed, the spherical 
micelles metamorphosed into large twisted bundles 
of fibers (fig. S20, C and D). These structures 
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resembled the bundles observed in the hydrogels 
formed by co-assembled fibers in which DNA was 
only present in a small percentage of the mono- 
mers. This result implies that the drastic shape 
transformation from micelles to filaments was 
driven by DNA hybridization. 

We then considered what would be the role 
of charge in the formation of such structures and 
designed monomers 3 and 3’ (tables S1 and S2), 
which contained complementary shorter DNA 
sequences that would experience weaker elec- 
trostatic forces. In these systems, we observed 
the formation of similar filamentous structures 
starting from spherical aggregates (fig. S21). To 
reduce electrostatic interactions further, we syn- 
thesized monomer 4 lacking the charges asso- 
ciated with nucleotides by replacing DNA with 


A D 
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a peptide nucleic acid (PNA) sequence (table S1 
and fig. S1) while keeping hybridization energy 
constant, as confirmed by the melting temper- 
ature (table S2). We found that monomer & self- 
assembled into filaments (fig. S22), which indicates 
that charge density is an important factor in the 
formation of spherical aggregates. We then com- 
bined monomer 4 with a complementary DNA- 
containing monomer 4’. Much to our surprise, 
within 24 hours after mixing 4 and 4’, we ob- 
served formation of pairs of intertwining fibers 
with a regular pitch (Fig. 2A). As solutions were 
allowed to age further (5 and 7 days), we dis- 
covered further growth of twisted structures 
containing many fibers. These results suggest 
that the pairs formed at early time points con- 
tained nonhybridized oligonucleotide segments, 


Growth rate (c~') 
0.09 


10 15 
Eintra(kgl) 


Fig. 1. Dynamics in DNA-peptide amphiphiles drives the formation of hierarchical structures. 
(A) Illustration of peptide amphiphile fibers cross-linked by DNA hybridization; fibers are shown 
in their initial state prior to monomer exchange. (B) SEM micrograph of the hydrogel formed 
upon DNA cross-linking. Two populations within the gel are shown, consisting of twisted bundles 
(diameter ~1 to 3 um) and single fibers (diameters between 10 and 15 nm). (C) Confocal 
reconstruction image of a section of the gel containing DNA monomers modified with the 
fluorescent dye Cy3. Bundles are shown in purple. (D) Simulation snapshots showing a 
homogeneous hydrogel when molecular exchange of DNA monomers between peptide 
amphiphile fibers is prohibited. Magnified view shows individual fibers (blue) with a stochastic 
distribution of DNA monomers (pink) along the fibers. (E) Simulation snapshots showing the 
emergence of bundles of fibers when molecular exchange is allowed. Magnified view shows 
bundle of fibers (blue) enriched with DNA (pink) in a matrix of individual fibers depleted of DNA 
monomers. (F) Bundle growth rate as a function of intra- and interfiber energies (Ejntra, Einter)- 
Bundles form within the energy range 5 keT < Ejntra < 10 kel (black arrows). 
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which created attachment points that then al- 
lowed further growth of the intertwined bundles. 

To investigate the mechanism of intertwin- 
ing, we simulated the interaction between com- 
plementary PNA and DNA filaments meeting 
at an arbitrary angle (Fig. 2B). The simulation 
showed that oligonucleotides hybridize first 
at the contact point, rapidly followed by fur- 
ther hybridization events as the fibers bend 
around each other to create an intertwined 
pair (movie S1). Because the intertwined state 
requires bending of the fibers (~1 kg7/nm), 
we hypothesized that the observed structure is 
thermodynamically less favorable than hybrid- 
ization among two parallel fibers. However, to 
achieve parallel arrangement, the intertwined 
structure faces an enormous energy barrier 
(>10 kg7T/nm) involving the breaking (and sub- 
sequent reforming) of hybridized oligonucleoti- 
des. This was confirmed by a free-energy analysis 
(fig. S23) and by the observation that the twist 
state of two complementary fibers was deter- 
mined by their initial contact angle (fig. S24). 
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When deformation of the soft fibers was taken 
into account, the degree of hybridization in- 
creased (thereby raising the free-energy bar- 
rier), but parallel alignment remained favorable 
(fig. S25). Thus, we concluded that the observed 
intertwined structure is likely a kinetically trapped 
state. Future atomistic simulations may reveal 
that the intertwined architecture can be driven 
by the nature of intermolecular packing within 
the supramolecular polymer. The simulation also 
showed that intertwined pairs display a relatively 
uniform pitch of approximately 300 nm, consistent 
with our experimental observations. In fact, the 
pitch saturated at a constant value for most initial 
contact angles between fibers (>25°; Fig. 2B) and 
sufficiently high DNA densities (>30%; Fig. 2C). 
However, the saturated pitch could be controlled 
by varying DNA length (Fig. 2D), fiber stiffness 
(fig. S26), or oligonucleotide type (DNA or PNA; 
fig. S27). 

The work described above in solutions con- 
taining complementary filaments provided us 
with mechanistic insight into the origin of bundle 
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Fig. 2. Programming the growth of intertwined bundles of fibers. (A) Transmission electron 
microscopy images after mixing complementary DNA- and PNA-terminated peptide amphiphiles 
show the time-dependent evolution of twisted bundles over 24 hours, 5 days, and 7 days. 

(B) Simulation snapshot of two intertwined complementary fibers. The intertwining pitch 
saturates for most initial contact angles (inset, bottom left). Hybridized DNA-PNA pairs between 
the two fibers (magnified view) form a twisted ribbon pattern. (©) Dependence of the pitch 

on the fraction of monomers with oligonucleotides. Simulation snapshots are shown for systems 
with 0.4%, 4%, and 40% oligonucleotide-modified monomers. (D) Dependence of the 

pitch on the length of oligonucleotides. Simulation snapshots are shown for duplexes with 


10, 25, and 40 DNA-PNA base pairs (bp). 
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formation in hydrogels. The superstructure ob- 
served in the hydrogels containing fiber bundles 
can be viewed as a hierarchical structure with 
multiple levels of molecular organization. The 
first level of structure involves the interactions 
leading to filament formation (hydrogen bond- 
ing and hydrophobic collapse), followed by inter- 
twining of fibers through DNA hybridization as 
a second level of structure. At even larger length 
scales in the hierarchical structure, bundle for- 
mation occurs via further hybridization among 
multiple intertwined fiber pairs, which then twist 
collectively. This description of the hierarchical 
structure is consistent with the large bundles dis- 
persed in a matrix of DNA-depleted PA nanofibers 
as shown in Fig. 1B. 

Given the possibility of melting interfiber DNA 
duplexes or breaking them using a competitive 
single-stranded oligonucleotide, we proceeded to 
investigate the reversibility of the hierarchical 
structure. First, we tested the effect of temper- 
ature and found that bundle-containing hydro- 
gels could be liquefied at 95°C. We then rapidly 
fixed the structure at elevated temperature by 
electrostatic gelation with calcium chloride and 
analyzed its structure by SEM (fig. S28). In sam- 
ples treated this way, large bundles completely 
disappeared and only a network of individual 
fibers was visible (fig. S28B). In contrast, when 
the liquefied hydrogel was cooled slowly and 
imaged by SEM, superstructures reformed (fig. 
S$28A). We infer that monomers once again re- 
distributed in space, hybridized, and recreated 
the bundles. To further probe the thermal melt- 
ing of bundles, we performed SAXS experiments, 
which indicated that at 95°C the fiber morphol- 
ogy persisted but the hierarchical bundling did 
not (fig. S29). 

We also investigated the use of a toehold- 
mediated strand-displacement mechanism to 
destroy interfiber DNA duplexes. Monomers 2 
and 2’ were designed to have an overhang se- 
quence that is not complementary. Thus, adding 
an “invader” oligonucleotide that is fully com- 
plementary to monomer 2 should reverse inter- 
fiber hybridization events. After simply adding a 
drop of solution containing the invader mole- 
cules to the hydrogel, we observed the complete 
disappearance of the bundled structures (fig. 
$28D). The invader strand also contained a short 
overhang sequence, which, upon addition of an 
anti-invader (fully complementary to the invader 
strand), allowed the hierarchical structures to 
reform (fig. S28C). 

The observed hierarchical structures appear to 
be chemically reversible by adding molecules or 
through changes in temperature. By adjusting the 
stoichiometry of invader oligonucleotides, we could 
form intermediate structures with small rather 
than large fiber bundles (fig. S30). The reversible 
transformation from bundled structures to in- 
dividual fiber networks also led to reversible 
changes in the bulk mechanical properties of the 
hydrogels (see fig. S30). Hydrogels with super- 
structures had bulk storage moduli that were 
15 times those of materials containing individ- 
ual fiber networks. Furthermore, atomic force 
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microscopy (AFM) nano-indentation studies con- 
firmed that the bundled fibers were stiffer than 
individual fibers by a factor of ~6 on average 
(fig. S31). 

The coarse-grained rather than atomistic 
nature of our simulations suggested that the 
observed phenomena should not be limited to 
oligonucleotides and could be encoded in other 
systems without the use of DNA chemistry. For 
this purpose, we designed various PA sequences 
(5-7) each containing at their termini two op- 
positely charged peptide domains (Fig. 3, table 
S1, and fig. S32). We reasoned that electrostatic 
interdigitation of such “sticky ends” would mimic 
DNA duplex formation (Fig. 3A). Co-assembly of 
these monomers with 1 yielded bundles of inter- 
twined fibers similar to those in DNA-containing 
systems (Fig. 3, B to D). Longer sequences of both 
charged residues and spacers resulted in greater 
bundle dimensions (Fig. 3, B to D). When pH was 
either raised or lowered by adding NaOH or HCl, 
the bundles disappeared owing to the lack of 
electrostatic complementarity (figs. S33 to S35). 
However, simply mixing two different fibers bear- 
ing oppositely charged peptide domains did not 
result in bundle formation (fig. S36). This differ- 
ence most likely arose because the fibers were 
kinetically trapped by electrostatic forces in a 3D 
gel. Alternatively, monomer exchange could reduce 
the thermodynamic driving force for bundling 
by mixing oppositely charged monomers on in- 
dividual fibers. 

The system investigated here has structural 
features that are biomimetic of mammalian ex- 
tracellular matrices (ECMs), a physical space that 
is known to undergo constant remodeling (15, 16). 
In natural ECMs, the networks of fibers vary 
widely in their organization and stiffness depend- 
ing on the tissue (77). Often, these features are 
controlled by the extent of bundling of fibers. 
Because our experimental ECM mimic effectively 
remodels reversibly upon addition of a water- 
soluble and biocompatible molecule, we chose to 
investigate how dynamic organization of fibers 
within a hydrogel network affects cells in culture. 
We selected cortical astrocytes from the central 
nervous system (CNS) for these experiments be- 
cause they are subjected to a changing matrix 
environment after injury to the brain or spinal 
cord, yet much remains to be learned about how 
these changes affect their behavior. In this injury 
environment, astrocytes become reactive, under- 
going drastic morphological changes and up- 
regulating glial fibrillary acidic protein (GFAP) 
and vimentin (/8, 19), a process known as astro- 
gliosis. The glial scar after injury to the CNS is 
spatiotemporally dynamic and contains a variety 
of macromolecules, including collagens, laminins, 
fibronectin, and proteoglycans among others. 
The glial scar contains increased concentrations 
of fibrillar collagen type I, an ECM component 
not usually found in the normal brain, which is 
composed of nonfibrillar collagen IV and glyco- 
saminoglycans (20, 21). 

Because our system can mimic aspects of 
the morphological changes in the brain micro- 
environment, we used it as a culture substrate 
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Fig. 3. Programming A 
hierarchical 
structures with a 
peptide code. 

(A) Molecular graphics 
representation of the 
complementary inter- 
actions between the 
DNA (top) and DNA- 
mimetic peptide 
amphiphiles (bottom), 
and the corresponding 
morphologies of 
bundled fibers 
observed in both 
systems by SEM. (B to 
D) SEM micrographs 
of bundled and twisted 
fiber morphologies of 
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varying diameters: (B) 140.5 + 15 nm, (C) 332 + 37 nm, and (D) 905 + 190 nm. Also shown are the 
corresponding dimer molecular graphics and chemical sequences of the DNA-mimetic peptide 
amphiphiles that form the superstructures (Cy. is the number of carbons in the aliphatic terminus of 
the amphiphiles; eg, ethylene glycol). Quantification of bundle diameters used a minimum of 15 
randomly selected images (taken from three independent batches) for each system. 


for cortical astrocytes isolated from postnatal 
mice. We tested the two states of the system— 
one with the superstructures consisting of bundled 
fibers and the other containing only individual 
fibers—and switched from one to the other by 
adding the invader strand. Figure 4A shows 
confocal micrographs of the cells labeled with 
GFAP and the nuclear stain 4’,6-diamidino-2- 
phenylindole (DAPI) after 10 days in culture. To 
our surprise, astrocytes cultured on bundled fiber 
(BF) hydrogels developed a reactive morphol- 
ogy and up-regulation of GFAP and vimentin 
(see Western blot data in Fig. 4, B to D), whereas 
those cultured on individual fiber (IF) substrates 
had the naive morphology observed under con- 
trol conditions (glass) and lacked overexpression 
of both proteins. As a positive control, we added 
dibutyryl cyclic adenosine monophosphate, which 
is well known to induce the reactive phenotype 
of astrocytes (22, 23). The data show that the re- 
sulting phenotype was similar to the one achieved 
on BF substrates (fig. S37). In addition, cells with 
the reactive phenotype were observed to up- 
regulate phospho-histone 3 (PH3), a marker of 
cell proliferation (Fig. 4, B to E). Proliferation 
was also demonstrated through staining with 
5-ethynyl-2’-deoxyuridine (fig. S38), which is only 
incorporated into actively dividing cells. Fur- 
ther confirmation of the reactive phenotype on 
BF substrates was provided, as expected, by an 
increase in reactive oxygen species (ROS) (Fig. 4F 
and fig. S39) (24, 25). 

We then considered that changes in pheno- 
type were linked to differences in mechanical 
properties between BF and IF substrates. However, 
cells exhibited the naive phenotype when cultured 
on non-DNA-containing hydrogels formed by 
self-assembly of monomer 1, which has a bulk 
modulus similar to that of the BF structure (fig. 
$40). Although these hydrogels had similar bulk 
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moduli, stiffness could be a factor in the pheno- 
typic change observed because AFM revealed 
that the superstructures were locally stiffer than 
individual fibers (fig. S31). However, stiffness can- 
not be the sole factor in the observed behavior, 
because cells cultured on glass (obviously a very 
stiff substrate) also exhibited the naive pheno- 
type. Moreover, glial scars where the reactive 
phenotype of astrocytes is observed are actually 
softer rather than stiffer relative to the normal 
CNS environment (26). Our results therefore sug- 
gest that the structural organization of the newly 
formed extracellular matrix after injury elicits 
astrocyte activation, and that this phenomenon 
is reversible if the matrix environment reverts 
back to the pre-injury structure. 

Having established the two distinct cell pheno- 
types on BF and IF substrates, we tested the 
response of the cells to the chemical reversibility 
of the artificial matrix from one state to the other 
by addition of the invader strand. Cells were cul- 
tured for 5 days on BF and IF substrates. At the 
end of this period, we added solutions of the 
invader and anti-invader strands to morphologi- 
cally remodel the matrix. Five days later, cells had 
switched from the reactive to the naive pheno- 
type when the invader strand was added to BF 
substrates, and from naive to reactive when the 
anti-invader strand was added to IF scaffolds. 
As indicated in Fig. 4, A to F, these changes in 
phenotype driven by dynamic changes of the 
substrate were accompanied by variation in pro- 
tein expression and ROS. Figure 4G shows SEM 
images revealing the morphological differences 
between reactive and naive astrocytes on bundled 
“terrain” versus single-fiber matrices. Moreover, 
in the case of BF substrates, cells appeared to 
interact closely with the bundles. Although as- 
trogliosis was thought to be unidirectional and 
irreversible, glial cells transplanted from an injured 
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Fig. 4. Modulating the phenotype of astrocytes on reversible hierar- 
chical ECM mimetic. (A) Confocal microscopy images of astrocytes 
plated on individual fibers (left), on bundled fibers (center), and after 
switching from bundles to individual fibers (right). Staining for GFAP 
(green) and cell nuclei (DAPI, blue) reveals cells with naive morphology on 
substrates of individual fibers and reactive morphology on substrates of 
bundled fibers. Scale bar, 50 um. (B) Western blot analysis of protein 
expression (related to cytoskeleton and cell proliferation) in astrocytes on 
indicated substrates. (C to E) Relative expression of proteins derived from 


spinal cord to an uninjured one (known to be 
stiffer than the injured one) reverted from the 
reactive to the naive phenotype (20), which sug- 
gests that architectural cues and not matrix stiff- 
ness can reversibly control astrogliosis. Future 
therapeutic strategies that “defibrillate” glial scars 
could be explored to reverse neural pathologies 
through astrocytic fate decisions. 

Our work shows that reversible superstruc- 
tures can be formed in supramolecular materials 
when their large-scale dynamics are directed by 
the formation of strong noncovalent bonds that 
can be externally disrupted. Mechanistic insights 
for this phenomenon were obtained using a 
computational model that also identified the 
molecular parameters that enable the bonding- 
directed spatial redistribution of monomers to 
form and disassemble the superstructures. Our 
initial observations used DNA hybridization as 
the strong interaction in the experimental system, 
but we showed that the principles learned can 
be applied to other strongly interacting chemical 
structures such as charged peptides. The dynamic 
supramolecular systems enabled us to discover 
how changes in architectural features in fibrous 
hydrogel networks can modulate important phe- 
notypic transformations in astrocytes linked to 
brain and spinal cord injury as well as neurolog- 
ical diseases. 
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Western blots in (B). All values were normalized to actin expression; three 
experiments were analyzed. ***P < 0.001 (least significant difference 
test). Error bars denote SD. (F) Reactive oxygen species (ROS) 
quantification on the different substrates relative to cell number. 
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micrographs of a reactive cell on bundled fibers and a naive cell on 
individual fibers. Cells are false-colored in blue. The magnified view (lower 
images) shows the cell-substrate interaction. Bundles are falsely colored 
in pink. Scale bars, 5um (upper images), 2 um (lower images). 
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BIOHYBRID MICROBES 


Light-driven fine chemical production 


in yeast biohybrids 


Junling Guo'*+, Miguel Suastegui’+, Kelsey K. Sakimoto”’?, Vanessa M. Moody*, 
Gao Xiao’, Daniel G. Nocera”, Neel S. Joshi’** 


Inorganic-biological hybrid systems have potential to be sustainable, efficient, 

and versatile chemical synthesis platforms by integrating the light-harvesting 
properties of semiconductors with the synthetic potential of biological cells. We 

have developed a modular bioinorganic hybrid platform that consists of highly 
efficient light-harvesting indium phosphide nanoparticles and genetically engineered 
Saccharomyces cerevisiae, a workhorse microorganism in biomanufacturing. 

The yeast harvests photogenerated electrons from the illuminated nanoparticles 

and uses them for the cytosolic regeneration of redox cofactors. This process enables 
the decoupling of biosynthesis and cofactor regeneration, facilitating a carbon- 

and energy-efficient production of the metabolite shikimic acid, a common precursor 
for several drugs and fine chemicals. Our work provides a platform for the 

rational design of biohybrids for efficient biomanufacturing processes with higher 


complexity and functionality. 


norganic-biological hybrid systems combine 
the light-harvesting efficiency of inorganic 
systems with established biosynthetic path- 
ways in live cells, thus promising a sustainable 
and efficient biochemical synthesis platform 
(1, 2). Comprehensive solar-to-chemical produc- 
tion has been investigated with bioinorganic hy- 
brid systems, including semiconductor-conjugated 
hydrogenases for biohydrogen production (3-5), 
long-wavelength-absorbing nanomaterials inte- 
grated into plants for enhanced photosynthetic 
efficiency (6), and photoelectrodes coupled with 
whole cells for hydrogenation reactions (7) as 
well as atmospheric CO, and Nj fixation (8-1). 
Microorganisms are used in biomanufacturing 
because of their rapid proliferation and ability to 
convert renewable carbon sources into higher- 
value chemicals through genetically program- 
mable multistep catalysis (72). In the context of 
inorganic-biological hybrids, autotrophic bacteria 
have been investigated intensively, with a focus 
on simple organic molecules (7-4). Interfacing 
heterotrophic organisms with light-harvesting 
inorganics may provide advantages, such as in- 
creased efficiency in the production of high-value 
chemicals (75-17). Common heterotrophs (e.g., 
Saccharomyces cerevisiae) are already used widely 
in industrial settings because of the large catalog 
of target metabolites accessible through genetic 
manipulation tools (78). The well-studied biology 
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of canonical model organisms may provide access 
to better genetic and analytical tools to unravel 
the mechanisms governing electron transport 
and metabolic flux in biohybrid systems (19). 

Of particular interest is the regeneration of the 
redox cofactor NADPH (reduced form of nicotin- 
amide adenine dinucleotide phosphate), owing to 
its central role as a cosubstrate in biosynthetic 
pathways (20). This process is strongly inter- 
twined with biomass production and is a com- 
mon bottleneck in the production of metabolites 
through microbial cell factories (21). The primary 
source of NADPH in yeasts is the pentose phos- 
phate pathway (PPP), which oxidizes a hexose 
sugar with concomitant loss of two equivalents 
of CO., decreasing theoretical carbon yields 
(22). Therefore, decoupling NADPH genera- 
tion from central carbon metabolism may help 
maximize carbon flux for the production of desired 
metabolites (20). 

We developed aS. cerevisiae-indium phosphide 
(InP) hybrid system, which combines rationally 
designed metabolic pathways and the electron 
donation capabilities of illuminated semiconduc- 
tors (Fig. 1 and fig. S1). InP was selected as a 
photosensitizer in this biohybrid system because 
its direct bandgap (E, = 1.34 eV) enables efficient 
absorption of a large fraction of the solar spec- 
trum and is positioned appropriately to accept 
electrons from various species in the culture me- 
dium (fig. $2) (23). Additionally, its stability to 
oxygen and biocompatibility suggest that InP is 
an ideal material for biological integration (24). 
InP nanoparticles were prepared independently 
(fig. S3) and subsequently assembled on geneti- 
cally engineered yeast cells by means of a bio- 
compatible, polyphenol-based assembly method 
(25) (Fig. 1A and fig. S1). Yeast strain S. cerevisiae 
Azwf1 was selected for the engineering of the 
biohybrid system. The deletion of the gene ZWF1, 
encoding the glucose-6-phosphate dehydrogenase 
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enzyme, disrupts the oxidative portion of the PPP 
(22), causing a marked decrease in cytosolic 
NADPH generation capacity (Fig. 1B). We exam- 
ined the integrated function of the biohybrid 
system to regenerate NADPH, which is essential 
for the biosynthesis of shikimic acid (SA), a pre- 
cursor of aromatic amino acids (Fig. 1, C and D). 
S. cerevisiae Azwfl was genetically engineered 
to overexpress four genes to enhance carbon 
flux through the SA pathway (Fig. 1B) (22). The 
pentafunctional protein Arol, which catalyzes 
the reduction of 3-dehydroshikimic acid (DHS) 
to SA, is selective for NADPH, and a low availa- 
bility of cytosolic NADPH directly affects the 
production of SA, leading to elevated accumu- 
lation of its precursor, DHS. In previous examples 
of biohybrid systems, light-harvesting semi- 
conductor particles attached to the surface of 
bacteria were able to provide reducing equiv- 
alents to central metabolic processes (12). We 
rationalized that the S. cerevisiae Azwf1l-InP 
hybrid system (fig. S4) could operate similarly, 
with electrons flowing from the illuminated, 
surface-bound InP particles to the regeneration 
of NADPH from NADP* (nicotinamide adenine 
dinucleotide phosphate) inside the cell (Fig. 1, C 
and D). This regenerated NADPH can fuel the 
ultimate conversion of DHS to SA (26). Therefore, 
the S. cerevisiae Azwfl-InP hybrids both enable 
us to evaluate the efficiency of NADPH regener- 
ation and lead to the enhanced biosynthesis of a 
highly sought-after molecule through photon 
energy conversion. 

We designed a series of control experiments 
in which the presence of light and InP nano- 
particles was varied (fig. S5). After 72 hours of 
aerobic growth, S. cerevisiae Azwf1-InP hybrids 
under illumination (5.6 mW cm”) (fig. S6) 
achieved the highest DHS-to-SA conversion rate 
with a SA/DHS ratio of 23.5 + 1.6 (Fig. 2A and 
figs. S7 to S9). In contrast, a control experiment 
without illumination led to a ratio of only 0.67 + 
0.3 (fig. S10). Similarly, a low SA/DHS ratio was 
observed in the presence of InP nanoparticles that 
were not assembled on the cell surface, suggesting 
the importance of proximity in enabling photo- 
chemical synthesis. This is in line with recent 
reports describing the ability of cell wall-bound 
components in yeasts to contribute to extracel- 
lular electron transport through electron “hop- 
ping” mechanisms (27). In the absence of InP, 
S. cerevisiae Azwf1 also showed a lower SA/DHS 
ratio, regardless of illumination scheme (figs. S11 
and S12). The total SA production of the illumi- 
nated biohybrid system was superior to all other 
conditions, with a final titer of 48.5 + 2.1 mg 1}, 
showing an 11-fold increase compared with its 
counterpart with no illumination and a 24-fold 
increase compared with engineered cells in the 
presence of unattached InP nanoparticles (Fig. 2B). 
DHS-to-SA conversion yield also increased, to a 
point, with higher light intensities but decreased 
under the highest light intensity, possibly as a 
result of metabolic saturation or photodamage to 
the cells (fig. S13). 

The SA/DHS ratio has previously been shown 
to serve as a metabolic readout for cytosolic levels 
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of NADPH/NADP* (28). The highest NADPH/ 
NADP” ratio calculated for the illuminated bio- 
hybrid experiment reaches a value of 87.1 + 6.0 
(Fig. 2C). Notably, this value was higher than 
even that measured for InP-free and -integrated 
wild-type S. cerevisiae, which possesses the fully 
functional machinery to produce NADPH through 
the oxidative PPP (fig. S14). S. cerevisiae Azwf1 
in darkness showed the lowest NADPH/NADP* 
ratios, regardless of the presence of InP. These 
results support the contention that irradiated, 
cell surface-assembled InP can drive the regen- 
eration of cofactor NADPH, facilitating the 
conversion of DHS to SA. As determined from 
colony-forming unit (CFU) assays on nutrient-rich 
solid media, cell viability did not differ before and 
after the assembly of InP nanoparticles (Fig. 2D), 
confirming the biocompatibility of the particle 
assembly protocol. During fermentation in selec- 
tive minimal media, cell count decreased for 
the biohybrids, regardless of the illumination 
scheme (see supplementary materials for addi- 
tional discussion). 

To further evaluate the metabolic performance 
of the biohybrid systems, we characterized their 
ability to consume glucose and variations in car- 
bon flux. Glucose was fully consumed by the bare 
cells during the first 24 hours, whereas nearly 
25% of the total initial glucose remained unused 
in the complete biohybrid scheme (Fig. 3A). 
The SA production kinetics in the S. cerevisiae 
Agwfl-InP hybrids showed that the conversion 
of DHS to SA occurred throughout the entire 
illumination period (Fig. 3B), suggesting a con- 
tinuous supply of NADPH and potential accu- 
mulation of biosynthetic intermediates along the 
SA pathway that lags behind glucose consump- 
tion (29). This finding was also supported by the 
consistently high mass fractions of SA (~90%). 
The specific SA yields of S. cerevisiae AzwfI-InP 
hybrids surpassed those of the S. cerevisiae Azwf1 
and the wild-type bare cells cultured in darkness 
(Fig. 3C). The light-to-SA conversion efficiency 
reached a maximum of 1.58 + 0.05% 12 hours 
after the start of fermentation and dropped as 
SA production plateaued (fig. S15). To unravel 
the variations in the central carbon metabolism 
caused by the illumination of InP in the bio- 
hybrids, we measured the production of secreted 
by-products, including ethanol and glycerol, 
linked to other pathways (figs. S16 and S17). 
Figure 3D shows that the concentration of these 
by-products produced by the illuminated bio- 
hybrids (C,) was lower than its counterpart 
under dark conditions (Cp). This implies that the 
surface-assembled, photoexcited InP shunts car- 
bon in S. cerevisiae Azwfl toward the desired SA 
pathway, with less activity in alternative path- 
ways for NADPH regeneration (e.g., pathways 
catalyzed by aldehyde dehydrogenase) (Fig. 3E). 

Though membrane-bound hydrogenases have 
been invoked to explain the ability of previously 
reported analogous bacterial systems to make use 
of photogenerated electrons (29-31), S. cerevisiae 
is surrounded by a cell wall composed of extra- 
cellular polymeric substances that would pre- 
vent direct contact between membrane-bound 
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proteins and the InP particles. The need for 
proximity between InP particles and the cells 
suggests that the cell wall might mediate elec- 
tron transfer in our biohybrids (27). Differential 
pulse voltammetry performed on the spent med- 
ium after fermentation (Fig. 3F and fig. S18) 
exhibited peaks more negative of the thermo- 
dynamic potential for NADP*/NADPH [E° = 
-0.324 V versus normal hydrogen electrode 
(NHE) at pH 7]. This higher redox activity after 
S. cerevisiae Azwfl growth suggests that soluble 
redox-active species likely also play a role in 
electron transfer, as has been reported for yeast- 
based microbial fuel cells without exogenous 
mediators (32). The electron transfer mecha- 
nism in these S. cerevisiae AzwfI-InP hybrids 
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remains an active subject of investigation, as 
multiple possible mechanisms (e.g., soluble med- 
jators and cell wall-bound redox mediators) 
could exist in our biohybrid system. 

The development of inorganic-biological hy- 
brid systems in yeast will enable expansion of 
this overall approach to the production of higher- 
value metabolites. For example, the production 
of benzylisoquinoline alkaloids, which is already 
established in yeast, requires the activity of more 
than 10 membrane-bound cytochrome P450 oxido- 
reductases that depend on NADPH as an electron 
donor (18). The technology presented in this work 
may thus elevate the production efficiency of 
alkaloid natural products and other drugs and 
nutraceuticals, though practical implementation 
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Fig. 1. Assembly of S. cerevisiae—InP hybrids and rationally designed metabolic pathways. 

(A) InP nanoparticles were first functionalized with polyphenol moieties and then assembled on the 
surface of genetically engineered yeast to form modular inorganic-biological hybrids. (B) Metabolic 
engineering scheme for overproduction of SA. S. cerevisiae Azwfl has the oxidative PPP disrupted 
(ZWF1), leading to low cytosolic NADPH pools, which directly affects the SA pathway and reduces 
carbon loss in the form of COz. (C and D) Schematic of cellular NADPH regeneration and SA 
biosynthesis assisted by photogenerated electrons from InP nanoparticles. G6P, glucose-6-phosphate; 
F6P, fructose-6-phosphate; Ri5P, ribulose-5-phosphate; E4P, erythrose-4-phosphate; PEP, phospho- 
enolpyruvate; DAHP, 3-deoxy-p-arabinoheptulosonate-7-phosphate; HEX, hexokinase; ZWF1, glucose- 
6-phosphate 1-dehydrogenase; PGI1, phosphoglucose isomerase; RKI1, ribose-5-phosphate 
ketol-isomerase; TKL1, transketolase; ARO4,229_, feedback-insensitive DAHP synthase; AROIpo920a, 
mutant pentafunctional aromatic enzyme; TCA, tricarboxylic acid cycle; h, Planck’s constant; v, 
frequency; h*, electron hole; e”, electron; D, putative electron donors in the cell culture medium; Dox, 


oxidized electron donor species. 
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will require the development of illumination 
sources that interface with scaled-up fermenters. 
Our synthetic scheme is highly modular, enabling 
a mix-and-match approach; makes use of cheap 
components; and is compatible with existing 
workhorse cellular chassis and a wide range of 


particle-cell combinations. A more thorough under- 
standing of electron transport mechanisms and 
global changes to metabolic flux will undoubt- 
edly facilitate design and implementation of 
even better biohybrid systems. These systems 
would make use of alternative energy sources to 


streamline metabolic efficiency. With an ever- 
growing set of genetic tools, functional nano- 
particles, and cell types, modular biohybrid 
platforms are likely to enable efficient and eco- 
nomical biochemical production of valuable and 
challenging targets. 
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Fig. 2. Physiological and metabolic characterization of the 

S. cerevisiae—InP hybrid system. (A) Comparison of SA/DHS ratios 
in biohybrids and in yeast-only fermentations with light and dark 
conditions. (B) Total accumulation of SA and DHS after 72 hours 

of growth. (C) Estimation of cytosolic-free NADPH/NADP* ratio, 
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Fig. 3. Carbon utilization, cytosolic-free NADPH, and electron trans- 
fer in the S. cerevisiae—InP hybrid system. (A) Glucose consumption 
over the course of 72-hour culture. (B) SA production profiles in light and 
dark conditions. SA/DHS conversion yield was expressed as a mass 
fraction: [SA]/([SA] + [DHS]). (C) Specific SA yield based on consumed 
glucose and cell dry weight (CDW). (D) Percent variation in SA and 
by-product formation of the biohybrids under light (CL) versus 

dark (Cp) conditions over time. (E) Proposed metabolic flux distributions 
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based on the conversion of DHS to SA. (D) Cell viability assay based 
on counting of CFU, performed on rich solid media. The inset 

shows that the preparation of the biohybrids does not affect the 
initial CFU amount. Variation is represented by SE (error bars) 

from three independent replicates for all data points. 
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based on total SA plus DHS concentrations and by-product (glycerol and 
ethanol) formation. CB, conduction band; VB, valence band. (F) Differential 
pulse voltammetry of culture medium before and after S. cerevisiae 
Azwfl growth. Arrows indicate electrochemical signatures from possible 
species with sufficient reducing potential to convert NADP* to NADPH. 
NaPi, sodium phosphate; FTO, fluorine-doped tin oxide. Variation is 
represented by SE (error bars) from three independent replicates for all 
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Wafer-scale single-crystal hexagonal 
boron nitride film via self-collimated 


grain formation 


Joo Song Lee”, Soo Ho Choi®, Seok Joon Yun*, Yong In Kim®, Stephen Boandoh®, 
Ji-Hoon Park**, Bong Gyu Shin*’”*, Hayoung Ko”, Seung Hee Lee”, 
Young-Min Kim*”, Young Hee Lee***, Ki Kang Kim®*, Soo Min Kim** 


Although polycrystalline hexagonal boron nitride (PC-hBN) has been realized, defects and 
grain boundaries still cause charge scatterings and trap sites, impeding high-performance 
electronics. Here, we report a method of synthesizing wafer-scale single-crystalline hBN 
(SC-hBN) monolayer films by chemical vapor deposition. The limited solubility of boron (B) 
and nitrogen (N) atoms in liquid gold promotes high diffusion of adatoms on the surface 
of liquid at high temperature to provoke the circular hBN grains. These further evolve into 
closely packed unimodal grains by means of self-collimation of B and N edges inherited 
by electrostatic interaction between grains, eventually forming an SC-hBN film on a wafer 
scale. This SC-hBN film also allows for the synthesis of wafer-scale graphene/hBN 
heterostructure and single-crystalline tungsten disulfide. 


exagonal boron nitride (hBN), called white 
graphite, consists of atomically flat layers 
of alternating hexagonal B and N atoms 
held together by van der Waals interac- 
tion between layers. The insulating hBN 
plays a role in a variety of fundamental science 
and technology fields, serving, for example, as 
a platform for charge fluctuation, contact resist- 
ance, gate dielectric, passivation layer, Coulomb 
drag, and atomic tunneling layer (/-6). Although 
micrometer-sized hBN grains have been com- 
monly employed for fundamental studies, wafer- 
scale single-crystalline hBN (SC-hBN) films 
are not yet available for practical applications. 
One approach to reach SC-hBN film is to start 
with grains of a triangular shape at random ori- 
entations and eventually merge them to form 
the polycrystalline hBN (PC-hBN) film. How- 
ever, grain boundaries between randomly ori- 
ented hBN grains inevitably yield PC-hBN film. 
An alternative to achieve SC-hBN film is there- 
fore desired. 
The concept for the synthesis of SC-hBN film 
is schematically shown in Fig. 1, A to C. Au is 
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liquefied at high temperatures (~1100°C) and 
robustly anchored on W foil still holding in 
solid state owing to the high melting temper- 
ature (~3422°C). The key idea is to retain a 
flat liquid Au with high surface tension to al- 
low for strong adhesion to borazine precur- 
sors (Fig. 1A, i). The solubility of B and N 
atoms in liquid Au (at 1100°C) is ~0.5 and ~0O 
atomic %, respectively (7, 8), ensuring preva- 
lent surface diffusion of B and N atoms rather 
than bulk diffusion (Fig. 1A, ii). At the initial 
growth stage (~30 s), the diameter sizes of cir- 
cular hBN grains are irregularly distributed from 
approximately a few micrometers to ~14.0 um 
(Fig. 1C, i, and supplementary materials and 
methods). The circular hBN grains increase 
to a regular size of 14.5 um after 10 min of 
growth (Fig. 1C, ii). These circular hBN grains 
are also observed on liquid Cu substrate at high 
temperatures, rather than triangular hBN grains 
on solid substrates at low temperatures (9-12). 
The detailed edge structures and the corre- 
sponding edge energy of circular hBN grains 
should be investigated further. High diffu- 
sion of adatoms on a smooth liquid surface at 
high temperatures provokes the circular hBN 
domains. The size and uniformity of hBN grains 
are strongly influenced by the content of bor- 
azine and H, (fig. S1). 

After a prolonged growth of 20 min (Fig. 1C, 
iii), the density of hBN grains further increases 
without a noticeable progressive change in size. 
Moreover, the well-regulated sizes of hBN grains 
are linearly aligned in some regions, indicated 
by white arrows. When two hBN grains merge, 
the individual grains are rotated by less than 
60° with respect to each other by means of at- 
tractive Coulomb interaction between B (Lewis 
acid) and N (Lewis base) atoms, leading to a 
seamless stitching by the self-collimated hBN 
grains (Fig. 1A, iii and iv, and fig. S2), which is 
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as a result of the cohesive energy of a B-N bond 
being much higher than that of an N-N or B-B 
bond at a high growth temperature (1100°C) 
(fig. S3). In addition, the orientation of the 
hBN grains is not commensurate with the lat- 
tice orientation of the underlying Au substrate, 
which is confirmed by electron backscatter 
diffraction measurements (fig. S4). The hBN 
grains are transformed further into a hex- 
agonal close-packed structure at 30-min growth 
time through the self-collimation of hBN grains 
(Fig. 1C, iv). The seamless stitching of aligned 
hBN grains with the absence of grain bound- 
ary was further confirmed by the statistical 
analysis of a series of selected-area electron dif- 
fraction (SAED) patterns in transmission elec- 
tron microscopy (TEM) by means of hBN transfer 
on a graphene-supported (or MoS,-supported) 
TEM grid (figs. S5 to S8). Even at a longer growth 
time (~90 min), hBN grains were not fully merged 
with the presence of nanopores observed at a 
fixed precursor flow rate (fig. S9). We were 
able to achieve the full coverage of wafer-scale 
hBN film through two-step growth of elevated 
growth time and additional precursor flow rate 
(Fig. 1A, vi, and Fig. 1C, v to vi, and supplemen- 
tary materials and methods). A wafer-scale full 
SC-hBN film is obtained with a size of 3 cm by 
3 cm, followed by transfer to an SiO./Si wafer 
(Fig. 1B). 

The grain size distribution and coverage 
as a function of growth time are displayed in 
Fig. 1D. The grain size rapidly increases to 
saturate at ~14.5 um within 5-min growth time, 
whereas the standard deviation of the grain 
size abruptly decreases. The hBN grains are rap- 
idly grown at a rate of ~283 um?/s owing to the 
high-temperature process (1100°C), which is 
~10° times higher than the typical growth rate 
of hBN on solid substrate at lower temper- 
ature (~1000°C) (10). Meanwhile, the hBN cov- 
erage gradually increases and saturates to a full 
coverage at 60-min growth time. We empha- 
size that monolayer SC-hBN is achieved on a 
wafer scale with no appreciable multilayer hBN 
islands, confirmed by scanning electron micro- 
scope (SEM) images (fig. S10). This implies that 
monolayer hBN film is grown exclusively by 
means of surface-mediated growth on catalytic 
metal substrate under the current growth con- 
ditions. The stoichiometry of B and N atoms is 
1:1.03, confirmed by x-ray photoelectron spec- 
troscopy (XPS) (fig. S11). Furthermore, the ex- 
pensive Au foil can be reused for the repeated 
growth of SC-hBN film (fig. $12). 

We characterized the single crystallinity of 
hBN film on a large scale by three different 
methods: electron diffraction in TEM, liquid 
crystal (LC)-assisted polarized optical micro- 
scope (POM), and low-energy electron diffrac- 
tion (LEED). The monolayer nature of hBN film 
is identified at the edge of hBN grains prepared 
on a TEM grid (Fig. 2, A and B), again con- 
firming the results of atomic force microscopy 
analysis (fig. S13). Furthermore, the high-resolution 
TEM displays clear hexagonal B and N atoms 
(Fig. 2C). Fast Fourier transform spots from the 
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whole image (inset of Fig. 2C) demonstrate only 
one set of hexagonal spots, assuring the hexago- 
nal structure of the sample. The d-spacings of the 
(1010) and (1120) planes are 2.17 and 1.26 A, re- 
spectively, confirming SC-hBN, in good agreement 
with reported values (73). The aberration-corrected 
dark field in the scanning tunneling electron 
microscopy image and intensity profile along 
the white-dashed line clearly distinguish the 
B and N atoms in the hexagonal lattice, with 
higher intensity of N atoms than that of B atoms 
(Fig. 2, D and E) (/4, 15). The bond length be- 
tween B and N atoms is 1.45 A (16). A series of 
SAED patterns vertically stacked by nine frames 
from regions I to IX in Fig. 2A (fig. S14) exhibit 
identical six hexagonal dots (Fig. 2F), ensur- 
ing that the hBN film is single crystalline in a 
selected area of ~300 um by 300 um. By spin- 
coating nematic LC on hBN film, the POM pat- 
terns in the absence of grains are not altered, 
regardless of polarized angles, again demonstrat- 
ing the single crystallinity of hBN on a large 
scale (Fig. 2G and fig. S15). This is contrasted 
with inhomogeneous grain speckles with polar- 
ized angles due to the presence of multi-hBN 
grains in the PC-hBN film (fig. S16) (77). LEED 
images from 16 different regions in a 4 mm by 
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Fig. 1. Synthesis of single-crystal hBN film. (A) Schematic illustration for 
the growth of SC-hBN film by means of self-collimated circular HBN grains with a 
rotation invoked by the attractive Coulomb interaction of B and N edges be- 
tween grains (i to vi). BZ, borazine. (B) Photograph of a wafer-scale SC-hBN 
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4 mm area with a separation of 1 mm display 
identical distorted hexagonal spots with the 
same rotation angle (Fig. 2H), indicating that 
the hBN film is indeed a single crystal over the 
whole area. An elongated uniaxial strain of 
~14% is observed, similar to that noted in pre- 
vious reports (18-20). The hBN lattice is relaxed 
after hBN transfer on the TEM grid and, con- 
sequently, the regular hexagonal lattice is pre- 
served by restoring the strain without provoking 
fracture. 

Our SC-hBN film can serve as a platform 
growth substrate for construction of a vertical, 
two-dimensional (2D) heterostructure or single- 
crystal 2D material on a wafer scale (27). We 
now demonstrate the synthesis of a single-crystal 
vertical graphene/hBN heterostructure (SC/Gr- 
hBN) and single-crystal WS, (SC-WS,) film on 
a wafer scale. Epitaxial graphene is successively 
grown in situ on SC-hBN film at 1100°C under 
methane atmosphere (Fig. 3A and supplemen- 
tary materials and methods). The orientation- 
ally aligned hexagonal graphene domains on 
SC-hBN film are clearly visible (Fig. 3B), even- 
tually achieving the monolayer graphene film in 
a large area at a prolonged growth time while 
preserving SC-hBN film (Fig. 3C). SC-Gr/hBN 


Complete film 


(vi) 90 min 


Grain size (um) 


16 November 2018 


film is transferred onto the SiO,-Si substrate 
and onto the TEM grid for further character- 
izations. The G-band (~1585 cm~) and 2D-band 
(~2686 cm’) peaks in the Raman spectrum are 
clearly detected without a noticeable D-band 
(~1330 cm") (Fig. 3D). Moreover, the Eo, phonon 
mode of monolayer hBN film is observed near 
1370 cm” (inset of Fig. 3D). The SC-Gr/hBN film 
stacked at eight different regions in SAED pat- 
terns in TEM (fig. S17) shows an identical set of 
six hexagonal dots (Fig. 3E). This proves that the 
well-defined structure between graphene and 
SC-hBN is constructed by means of AA’-stacking 
with two distinct dots assigned to the (1010) 
planes of graphene and hBN (inset of Fig. 3E) 
and the relative rotation angle between two dots 
of 0.64 + 0.34° obtained from eight different 
regions (fig. S17). Furthermore, the moiré pattern 
of Gr/hBN heterostructure is clearly observed 
with a wavelength of ~9.20 nm rotated by 1.2° 
(Fig. 3, F and G, and fig. S18), ensuring the suc- 
cessful synthesis of SC-Gr/hBN film through van 
der Waals epitaxy (22). The distinct B-K, C-K, and 
N-K edges in the electron energy loss spectros- 
copy with preserved stoichiometry of B and N of 
1.03:1 further support the growth of SC-Gr/hBN 
film (fig. SI8C). 
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film on a SiO2-Si wafer. (©) Growth evolution of SEM images of hBN film. 
Single-headed arrows indicate linear alignment of hBN grains. (D) Time evolution 
of MBN grain size and coverage. Full coverage of monolayer hBN film is achieved 
at 60-min growth time. Error bars indicate the size deviation of hBN grains. 
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AA’-stacking 


Boron @ Nitrogen @ Carbon 


Fig. 3. Direct growth of vertical SC-Gr/hBN heterostructure and 
single-crystal WS> film. (A) Schematic for the direct growth of 
commensurate epitaxial graphene on SC-hBN film. (B) SEM image of 
as-grown hexagonal graphene domains on SC-Gr-BN film and (C) the 
corresponding large-area SC-Gr/hBN film. The dashed lines in (B) 
indicate the aligned orientation of individual hexagons. (D) Represent- 
ative Raman spectrum of graphene/hBN heterostructure after transfer 
onto SiO2/Si substrate. The inset shows the Raman spectrum near 
1370 cm, assigned to the E’2, phonon mode of hBN. (E) Combined 
SAED pattern stacked vertically by eight SAED patterns for 
graphene/hBN heterostructure. The inset shows a zoomed-in SAED 
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Fig. 2. Atomic structures of 
SC-hBN film. (A) TEM image of 
SC-hBN film transferred onto 
TEM grid divided into nine 
regions. (B) The folded edge of a 
monolayer (1L) SC-hBN film. 
(C) High-resolution TEM image 
of SC-hBN film. The inset shows 
the fast Fourier transform of 
the whole image. The d-spacings 
of (1010) and (1120) planes of 
hBN are 0.217 and 0.126 nm, 
respectively. (D) Aberration- 
corrected dark-field TEM image 
of SC-hBN film. Cyan and blue 
balls indicate B and N atoms, 
respectively. (E) Intensity profile 
along white-dashed line in (D). 
a.u., arbitrary units. (F) Vertically 
stacked SAED pattern image 

of nine segments from regions | 
to IX in (A). (G@) Schematic 
illustration of aligned LC (5CB) 
on SC-hBN and optical images of 
5CB-coated SC-BN film as a 
function of the polarized light 
angles: O, 30, 60, 90, and 180°. 
(H) LEED pattern images of 
SC-hBN film on Au substrate 
over an area of 4 mm by 4 mm. 
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pattern of one of the hexagonal spots, revealing an angle deviation of 
~1° between two spots. (F) Representative moiré pattern of graphene on 
hBN, with a rotation angle of ~1.2° between the components, and 

(G) the corresponding simulated moiré pattern. The wavelength of the 
moiré pattern is ~9.20 nm. The three insets depict the diffraction 
pattern and schematics of graphene and hBN lattices, each rotated 
21.8° and 23.0° from the horizon. (H) SEM image of triangular 

WS2 domains grown on SC-hBN film. The right top and bottom insets 
show the SEM image of as-grown single-crystal monolayer WSo film 
with a scale bar of 100 um and the corresponding LEED pattern image 


with (1120) and (1010) planes of monolayer WSo. 
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Meanwhile, the ex situ growth of SC-WS, film 
on SC-hBN film is performed. To synthesize SC- 
WS., SC-hBN film is coated with a W precursor, 
and the growth of WS, film is further carried out 
at 900°C under Ar, H», and ammonium sulfide 
atmosphere (supplementary materials and 
methods). The aligned triangular WS, domains 
are clearly observed (Fig. 3H), and large-scale 
monolayer WS, film successfully proceeds at a 
prolonged growth time (right-top inset of Fig. 3H). 
This is markedly distinct from a previous report 
that both on-top and inverse triangular WS, do- 
mains are mixed on bare Au, leading to poly- 
crystalline WS, film (figs. SI9A and S20) (23). 
Therefore, SC-hBN film plays a crucial role to at- 
tain homogeneous orientation of the triangular 
WS, domains. The synthesis of such homoge- 


Bare Cu 


e Boron 


& Nitrogen 


Al disk 


Water vapor 
detection 


Fig. 4. Protecting layer against Cu oxidation and water vapor barrier 
applications of wafer-scale SC-hBN film. SEM images of SC-hBN-, 
PC-hBN-covered, and bare Cu foils before (A, F, and K) and after (B, G, and 
L) oxidation in air at 300°C for 1 hour. Optical (©, H, and M) and corre- 
sponding XPS (D, I, and N) mapping images of SC-hBN, PC-hBN, and bare 
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nous triangular WS, domains is further confirmed 
by Raman spectroscopy, photoluminescence, and 
scanning tunneling electron microscopy (fig. S21). 
All triangular WS, domains have similar aligned 
orientations within +1.33° (fig. S19, C and D), and a 
single set of six hexagonal dots of SC-WS, film 
(LEED) is clearly observed (Fig. 3H), revealing 
that the large-area monolayer WS, film is in- 
deed a single crystal. The growth of MoS, do- 
mains with similar orientations is also achieved 
(fig. S22), further supporting universal growth 
of SC-hBN film as a substrate for the transition 
metal dichalcogenides. In contrast with SC-Gr- 
hBN heterostructure film, the presence of hBN 
after growth of WS, film is not detected, in- 
dicating that the hBN film might have been 
substituted or etched away during the growth of 
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WVTR (g / m? - day) 


WS, film. Further study is required to under- 
stand the growth mechanism for SC-WS, film 
on SC-hBN film. 

We further demonstrate that the wafer-scale 
SC-hBN film can serve as a protecting layer 
against metal oxidation and a gas-diffusion bar- 
rier for water vapor transmission (24, 25). Cu 
foil is chosen as a test metal, which is easily 
oxidized in air. For the oxidation test, hBN film 
is transferred onto Cu foil (Supplementary ma- 
terials and methods). For comparison, PC-hBN 
film synthesized on solid Au substrate is used 
(fig. S9). The Cu surface covered by SC-hBN film 
is not noticeably changed after the oxidation test 
at 300°C in air, whereas both PC-hBN-covered 
and bare Cu surfaces are severely oxidized (Fig. 4, 
A, F, K and B, G, L), indicative of the change 
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Cu samples after oxidation. (E, J, and O) Representative Cu 2p core level 
spectra from the circle, triangle, and square symbols from (D), (I), and (N). The 
peaks near 952.2 and 932.3 eV in the spectra are assigned to Cu 2py/2 and 
Cu 2p3,2, respectively. (P and Q) Schematic and photograph for the WVTR 
measurement. (R) WVTR values of PET, PC-hBN, and SC-hBN samples. 
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of color from orange to dark orange in each 
photograph. Some regions of PC-hBN-covered 
Cu surface withstand oxidation because of the 
presence of hBN grains (Fig. 4G), but O2 or HO 
gases easily permeate through the structural 
defects such as grain boundary or point defect, 
leading to Cu oxidation. For quantitative analysis 
of Cu oxidation, the samples are characterized by 
optical microscope and corresponding XPS map- 
ping for the Cu’* satellite peak near 943 eV in 
Cu 2p core-level spectra, related to CuO (Fig. 4, 
C, H, M and D, I, N) (26). The XPS mapping 
image for the SC-hBN sample is quite uniform 
with low intensity of the CuO peak (Fig. 4D), 
whereas PC-hBN and bare Cu samples show 
prominent CuO peaks (Fig. 4, I and N). The 
representative CuO peak in Cu 2p core-level 
spectra for SC-hBN sample is negligible, whereas 
it appears developed for both PC-hBN and bare 
Cu samples (Fig. 4, E, J, and O). For gas-diffusion 
barrier application, water vapor transmission 
rate (WVTR) measurement is carried out for hBN 
film transferred onto polyethylene terephthalate 
(PET) film (Fig. 4, P to R and supplementary 
materials and methods). The WVTR values of 
PC-hBN and SC-hBN monolayer films are ob- 
tained to be 1.01 and 0.60 g/m?-day, respec- 
tively, which are 30% and 58% less than that 
of PET (1.44 g/m?-day). The SC-hBN monolayer 
film outperforms the PC-hBN monolayer film 
by approximately a factor of 2. The obtained 
WVTR value is comparable to that of monolayer 
polycrystalline graphene film (27), but it is en- 
visaged to be further improved after optimiza- 
tion of the transfer technique. The wafer-scale 
SC-hBN film does not have any grain boundary, 
resulting in a water vapor barrier and the com- 
plete protection against Cu oxidation. 

In summary, we have synthesized SC-hBN film 
by means of self-collimation between self-regulated 
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circular hBN grains without a grain boundary. The 
key step is the facile rotation of circular hBN grains 
on the liquid Au substrate, regulated by attractive 
electrostatic interaction between B and N atoms at 
the perimeter of each grain that eventually leads to 
the single-crystal growth of hBN film on a wafer 
scale. The SC-hBN film serves as a promising sub- 
strate for the single-crystal growth of the graphene/ 
hBN heterostructure and WS, film on a wafer 
scale. Our strategy for the synthesis of SC-hBN 
film opens a new horizon for the single-crystal 
growth of other 2D materials and their hetero- 
structures on a wafer scale. 
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ULTRAFAST DYNAMICS 


Beyond the molecular movie: 
Dynamics of bands and bonds during 
a photoinduced phase transition 


C. W. Nicholson}, A. Liicke”, W. G. Schmidt”, M. Puppin’, L. Rettig’, 


R. Ernstorfer’, M. Wolf’* 


Ultrafast nonequilibrium dynamics offer a route to study the microscopic interactions that 
govern macroscopic behavior. In particular, photoinduced phase transitions (PIPTs) in 

solids provide a test case for how forces, and the resulting atomic motion along a reaction 
coordinate, originate from a nonequilibrium population of excited electronic states. 

Using femtosecond photoemission, we obtain access to the transient electronic structure 
during an ultrafast PIPT in a model system: indium nanowires on a silicon(111) surface. We 
uncover a detailed reaction pathway, allowing a direct comparison with the dynamics predicted 
by ab initio simulations. This further reveals the crucial role played by localized photoholes 

in shaping the potential energy landscape and enables a combined momentum- and 
real-space description of PIPTs, including the ultrafast formation of chemical bonds. 


eactive events in nature are associated with 

the formation or breaking of chemical 

bonds. Within the Born-Oppenheimer ap- 

proximation (J), a description of reactions 

that separates the atomic and electronic 
degrees of freedom is used, such that the atomic 
system evolves across a potential energy surface 
defined by the transient electronic structure. To 
test the validity of this nonequilibrium approach, 
whether in finite or extended systems, requires 
knowledge of both atomic and electronic struc- 
ture on ultrafast time scales. The ultrafast dynam- 
ics of insulator-to-metal phase transitions offer 
an especially promising route because the change 
in electronic structure during these events is par- 
ticularly extreme, and typically accompanied by 
a structural distortion. Ultrafast techniques have 
opened up avenues for exploring the interplay 
between the atomic and electronic subsystems 
(2-5), including during photoinduced insulator- 
to-metal transitions (6-9); these techniques ad- 
ditionally enabled the making of reciprocal space 
movies charting electronic structure dynamics 
(7, 10) and “molecular movies” (4, 17), which fol- 
low the real-time position of atoms during struc- 
tural changes. Uniting these concepts to examine 
not only atomic positions, but also the underly- 
ing electronic structure determining the reaction 
pathway along the potential energy surface (PES), 
has been a long-pursued goal (12). Time- and angle- 
resolved photoemission spectroscopy (trARPES) 
is ideally suited for accessing the nonequilibrium 
electronic structure, as it allows direct access to 
the electronic band structure on ultrafast time 
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scales and its occupation in momentum space 
(k). Furthermore, this picture of electronic bands 
in periodic systems, often favored by physicists, 
is Fourier-equivalent to a real-space (7) descrip- 
tion of chemical bonds (13, 14), which suggests 
the possibility of following ultrafast bond dy- 
namics in 7-space (15) based on measurements in 
k-space (16). We realize this by determining the 
reaction pathway—including the full electronic 
structure dynamics—during an ultrafast struc- 
tural phase transition at a surface, thereby going 
beyond the molecular movie concept. 

Our model phase transition system consists of 
atomic indium nanowires on the (111) surface of 
silicon, denoted In/Si(111). The system undergoes 
an order-order structural transition accompa- 
nied by an electronic insulator-to-metal transition 
(7, 18). A close interplay between the electronic 
structure and specific lattice motions during the 
phase transition has been predicted that, in ad- 
dition to a detailed knowledge of the equilibrium 
structure (19-27), makes this system ideal for 
investigating ultrafast changes in both k- and 
r-space. Recent time-resolved electron diffraction 
measurements have revealed that the structural 
photoinduced phase transition (PIPT) is com- 
pleted within 1 ps (22), but such a technique does 
not give direct access to the underlying transient 
electronic dynamics. 

Here we use trARPES to follow the ultrafast 
evolution of the electronic band structure during 
the PIPT in In/Si(111), which, combined with ab 
initio molecular dynamics (AIMD) simulations, 
allows access to the microscopic forces and 
mechanisms driving the structural transition 
and the dynamics of chemical bonds. To mea- 
sure the dynamics of the electronic structure of 
In/Si(111), we have developed a 500-kHz repeti- 
tion rate extreme ultraviolet (XUV) source at 
22 eV (23), representing a substantial advance 
compared with the state of the art (24, 25). This 
allows efficient access to the full, or even multiple, 
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Brillouin zones (BZs) in many materials. A sche- 
matic trARPES experiment is shown in Fig. 1A: 
The pump pulse (Av = 1.55 eV) excites electrons 
above the Fermi level (Z;); the electrons are then 
ejected from the sample after a variable delay 
time At by the probe pulse (Av = 22 eV). A cross- 
correlation of 40 fs between pump and probe 
pulses is obtained. In contrast to traditional ARPES 
(26), this allows simultaneous access to the elec- 
tronic structure above and below Er (Fig. 1B). 

In/Si(@111) undergoes a transition from an 
insulating (8 x 2) to a metallic (4 x 1) structure 
above 130 K (27, 28) (Fig. 1, C and D). The bond- 
ing motif in the insulating phase (Fig. 1C) consists 
of distorted hexagons, whereas in the conducting 
phase, the In atoms rearrange into zig-zagging 
chains (Fig. 1D). The k-space band structures 
of the two phases calculated within the GW ap- 
proximation are given in Fig. 1, E and F. In con- 
trast to the (4 x 1) phase, which has three metallic 
bands (7m, to m3) that cross Ey (17) (Fig. 1F), the 
(8 x 2) phase is gapped at the I's... and Xgxo 
points (Fig. 1E). Upon increasing the temperature 
across the (8 x 2) to (4 x 1) phase transition, the 
states initially lying far above Ey at's... shift down 
in energy and eventually cross E,, forming the 
metallic m, band of the (4 x 1) phase. Concur- 
rently the energy gap in the 7m, and mz bands at 
the Xx. point closes, and the bands shift apart 
in momentum along the x, direction (23). We 
note that the three metallic bands predicted from 
the calculation in the (4 x 1) phase are clearly ob- 
served in Fig. 1B. The Fermi surface of the (4 x 1) 
phase in Fig. 1G shows the momentum cut along 
which our data are obtained. 

To investigate the PIPT, we cooled the sample 
to 25 K and photoexcited it by a pump pulse with 
incident fluence F = 1.35 mJ cm~’, which cor- 
responds to an excitation density in the surface 
In layer of around one electron per unit cell, im- 
plying a homogeneous excitation far from a dilute 
limit. Selected snapshots following excitation are 
shown in Fig. 2, A to D. At At = —450 fs (Fig. 2A), 
the XUV pulse arrives before the pump pulse; 
hence, the band structure reflects the unper- 
turbed (8 x 2) phase with only states below E, 
occupied. Shortly after excitation, at At = 50 fs 
(Fig. 2B), previously unoccupied states above Ez 
become clearly visible. An evolution of electronic 
states occurs, most clearly observed for the states 
around ['sx.9 (Kz = 0.75 A), which shift down in 
energy between At = 50 and At = 250 fs (Fig. 2C). 
At At = 900 fs (Fig. 2D), the system has fully 
transformed into the (4 x 1) phase. The overlaid 
GW band structure for the two phases highlights 
the occurrence of the PIPT. 

The dynamics of selected spectral features 
chart the progress of the PIPT (Fig. 2E). The 
arrows in Fig. 2, A to D, mark the positions and 
the direction along which one-dimensional slices 
of the data are analyzed and fitted to obtain the 
band positions presented in Fig. 2E as a function 
of time delay [see also (23)]. The fastest dynamics 
are found at X¢g,.5 (red arrow in Fig. 2A), where 
the band gap closes within 200 fs, thus defining 
the ultrafast insulator-to-metal transition. As a 
second step, the conduction band edge at the BZ 
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zone center (orange arrow) is found to reach E, 
after 500 fs. Finally, the structural transition, as 
measured by the splitting between bands ™m., mz 
(Fig. 2D, blue arrows), is completed after ~’700 fs. 
This third time scale is in excellent agreement 
with the structural transition time scale ob- 
served by time-resolved electron diffraction, 
which is completed after ~700 fs with a time 
constant t = 350 fs (22). It is notable that even 
before the structural transition is completed, two 
physically meaningful electronic transitions have 
occurred. 


Fig. 1. Experiment overview and material sys- 
tem. (A) Schematic trARPES experiment, where 
At is the variable delay between pump (red) 
and probe (purple) pulses. (B) Excited-state 
photoemission data (log-color scale) obtained at 
T = 150 K in the metallic (4 x 1) phase with an 
excitation fluence F = 2 mJ cm”. (C) Schematic 
r-space structure in the (8 x 2) phase and (D) in 
the (4 x 1) phase. Solid black lines highlight 

the structural motifs of the two phases, blue 
lines represent bonds. (E) Electronic band 
structure (k-space) calculated within the GW 
approximation in the (8 x 2) phase and in (F) for 
the (4 x 1) phase, corresponding to the 
structures in (C) and (D). The experimental 
characterization of the two phases is shown in 
(23). (G) Fermi surface obtained at 150 K with 
the 22-eV laser revealing the cut along which 
time-resolved measurements were obtained 
(white line). Solid orange lines mark the (4 x 1) 
BZ boundaries, whereas dashed lines mark the 
boundaries of the (8 x 2) BZ. High-symmetry 
points in the two phases are marked with crosses. 


Fig. 2. Electronic and atomic structure 
during photoinduced phase transition. (A to 
D) trARPES data (F = 1.35 mJ cm™’) ona 
logarithmic color scale at selected delays at a 
base temperature of T = 25 K. Arrows highlight 
the positions of the features of interest, which are 
followed in (E). (E) Dynamics of the features 
marked by arrows in (A) and (D). Red data points 
track the size of the band gap at the zone 
boundary over time, whereas the orange data 
mark the position of the band edge at the zone 
center with respect to the Fermi level. The blue 
data reveals the change of splitting between the 
two innermost bands marked in (D). Solid curves 
are the dynamics of the relevant spectral features 
from AIMD simulations, rescaled with respect 

to the GW band structure. For further details, 

see (23). (F) Evolution of the atomic structure 
(AIMD trajectories) through the PIPT, showing the 
mean squared displacement of the atomic 
positions from the (4 x 1) phase following excitation: 
Soi Rj — Riaxal 2 ‘Trajectories for two initial 
excitation conditions are shown, including (blue) 
and not including (purple) the observed localized 
hole population; only the former drives the PIPT. 
During the PIPT, the relevant atomic modes evolve 
with an average speed of 0.1 pm fs (23). 
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The distinct time scales of these three spectral 
features reveal a detailed pathway of the phase 
transition as it evolves along the electronic PES. 
To gain microscopic insight into the evolution of 
the atomic structure, electronic properties, and 
bond strengths along this pathway, we have 
performed AIMD simulations based on density 
functional theory (DFT) within the local density 
approximation (LDA), constrained by the exper- 
imental results. Because the experiment reveals 
the transient changes to the electronic states and 
their occupation, these can be used to simulate 
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realistic excitation scenarios with AIMD. We have 
mapped the experimental k-space distribution 
of excited carriers across multiple BZs in Fig. 3. 
This reveals that electrons are strongly delocalized 
throughout the BZ, in contrast to photoholes, 
which are localized at the BZ boundary. Such a 
distribution is substantially different from that 
of the excitation conditions assumed in a pre- 
vious study, which forced excited electrons to be 
confined to the BZ center (22). In a first attempt, 
we assume transiently hot electronic distribu- 
tions in the AIMD simulations based on the 
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At=0 fs 


electrons 


Fig. 3. k-space distribution of excited 
carriers. Experimentally measured difference 
map of the photoemission signal throughout 
multiple BZs in the (8 x 2) phase, revealing 
the distribution of excited electrons (red) 

and holes (blue) following photoexcitation 

(F = 0.7 mJ cm’). The distribution is obtained 
from the difference between spectra before 
excitation (At = -1000 fs) and At = 0 fs. 


experimentally determined time-dependent elec- 
tronic temperature (fig. S6). However, the cor- 
responding calculated trajectory (purple curve in 
Fig. 2F) describes an incomplete phase tran- 
sition: The system starts to evolve from the (8 x 2) 
phase toward the (4 x 1) phase, but finally returns 
to the (8 x 2) ground state. This indicates that the 
LDA-DFT electronic structure is not sufficiently 
accurate. Indeed, the inclusion of electronic self- 
energy effects within the GW approximation 
raises the energy of the uppermost zone bound- 
ary valence state by about 0.2 eV with respect to 
the zone center states (fig. S7). Self-energy effects 
beyond the LDA thus lead to the preferential con- 
finement of photoholes at the BZ boundary as 
experimentally observed (Fig. 3). Unfortunately, 
AIMD simulations based on a self-energy cor- 
rected electronic structure are computationally 
prohibitively expensive. Therefore, we compen- 
sate the misalignment of the valence state energies 
on an ad hoc basis by fixing the occupation num- 
bers (on top of the thermal occupation) in the 
AIMD simulations such that holes occur at the 
BZ boundary and the zone center valence states 
are occupied (23). The AIMD simulation based 
on this excitation scenario now indeed results in 
a complete phase transition (Fig. 2F, blue curve). 
This underlines the role of zone-boundary photo- 
holes as a key driving force in the structural tran- 
sition. Moreover, the corrected AIMD simulation 
reproduces all three time scales observed in the 
k-space experiment (Fig. 2E, solid lines), reveal- 
ing a high level of accuracy in the simulated PES 
and the corresponding trajectory, even on these 
ultrafast time scales. The excellent agreement 
between our data and the simulations is strong 
evidence for the coherent directed motion of 
atoms within all unit cells during the PIPT, in 
accord with the previous electron diffraction study 
(22). Such ultrafast directed dynamics cannot 
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k-space 


r-space 


Time after excitation 


Fig. 4. Dynamics of bands and bonds during the insulator-to-metal transition. (A to C) Position 
of the k-space bands close to the I'g,.2 point at selected time delays extracted from the trARPES 
data, overlaid on the calculated LDA band structure (color-filled for clarity). Error bars mark a 95% 
confidence level. (D to F) Corresponding r-space dynamics of the orbital, obtained from the Fourier 
transform of the k-space band structures associated with the 'g,.2 band in (A) to (C). Both the shape 
of the orbital distribution and the bond strength—indicated by the color scale—change during the 
phase transition, as a bond across the indium hexagon is formed. A complementary picture of 
charge transfer during the bond formation and breaking, as well as movies of the full k- and r-space 
dynamics (movies Sl and S2), can be found in (23). 


be explained by a statistical picture of the phase 
transition where different regions of the sample 
evolve incoherently (23). 

To further exemplify the high level of agree- 
ment between experiment and theory, in Fig. 4, 
A to C, we compare the calculated band struc- 
ture at three snapshots during the PIPT with the 
corresponding band position at I's... extracted 
from our data. Both the calculated energetic posi- 
tion and the slope of the dispersion are observed 
to change in agreement with the experimental 
data. This agreement enables us to extract the 
7-space dynamics of nuclei and chemical bonds 
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during the PIPT from the simulation. To do so, 
we plot the electronic orbitals associated with 
the bands discussed above at the BZ center (I's) 
in Fig. 4, D to F, again for three snapshots. A 
transition from an orbital localized between op- 
posite In hexagon atoms to a delocalized metallic 
state along the In chains is clearly seen during 
the PIPT. 

To describe chemical bond formation addi- 
tionally requires a measure of the bond strength. 
A quantitative understanding of bond strengths 
in extended systems can be gained from the 
crystal overlap Hamiltonian population (COHP) 
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(29, 30), which resolves each band into bonding 
and antibonding contributions as a function of 
energy—essentially a bonding character density 
of states for each electronic band. By performing 
a COHP analysis along the AIMD trajectory, we 
obtain the evolution of the surface bond strengths 
during the phase transition (23). In Fig. 4, D to F, 
we show the formation of an In-In bond across 
the neighboring chains. A gradual evolution of the 
bond strength up to 2 eV is observed, encoded 
in the blue-to-red color scale applied to the or- 
bitals in Fig. 4, D to F. Combined with the orbital 
distribution, this reveals the ultrafast formation 
of an In-In bond during the transition into the 
(4 x 1) structure, on the same time scale as the 
closing of the electronic gap in this region, i.e., 
within 500 fs. The buildup of bond strength thus 
parallels the transition from a localized molecu- 
lar orbital (insulator) to a delocalized (metallic) 
state during the phase transition. 

From our analysis, the following complete 
microscopic mechanism for the PIPT emerges: 
Upon excitation, holes are created in the bond- 
ing states at Xs,.2, which correspond to In-In 
dimer bonds between the outer In chain atoms 
(23). Consequently, the dimer bonds character- 
istic for the hexagon structure weaken and break. 
At the same time, a sizable fraction of excited 
electrons populates the states at I's. that are 
formed by a bonding combination of In states 
from neighboring In chains. Population of these 
excited states leads to interatomic forces that 
transform the hexagons into zig-zag chains, re- 
sulting in bond formation (Fig. 4, D to F). The 
electron band related to these bonds (77) is lowered 
in energy as the In atoms contributing to this 
bond approach each other, further populating 
those states and strengthening the bond. It finally 
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crosses the Fermi energy as shown in Fig. 4C, re- 
sulting in the metallic state of the (4 x 1) phase. 

Our combined experimental and theoretical 
approach extends the molecular movie concept 
by revealing the ultrafast electronic structure dy- 
namics that govern a nonequilibrium structural 
transition. This unifying description bridges two 
fundamental concepts of physics and chemistry— 
band structure and chemical bonds—during ultra- 
fast reactions. Besides elucidating the effect of 
the nonequilibrium electronic structure on struc- 
tural dynamics, understanding the potential ener- 
gy landscape induced by excitation paves the 
way for reaction pathways engineered via tai- 
lored excitation, potentially allowing optical con- 
trol over such dynamic processes. 
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NETWORK SCIENCE 


Quantifying reputation 


and success in art 


Samuel P. Fraiberger’”, Roberta Sinatra’®**, Magnus Resch®”, 
Christoph Riedl"?*, Albert-Laszl6 Barabasi?®*°* 


In areas of human activity where performance is difficult to quantify in an objective 
fashion, reputation and networks of influence play a key role in determining access to 
resources and rewards. To understand the role of these factors, we reconstructed the 
exhibition history of half a million artists, mapping out the coexhibition network that 
captures the movement of art between institutions. Centrality within this network captured 
institutional prestige, allowing us to explore the career trajectory of individual artists 

in terms of access to coveted institutions. Early access to prestigious central institutions 
offered life-long access to high-prestige venues and reduced dropout rate. By contrast, 
starting at the network periphery resulted in a high dropout rate, limiting access to central 
institutions. A Markov model predicts the career trajectory of individual artists and 
documents the strong path and history dependence of valuation in art. 


he Man with the Golden Helmet, an 18th- 

century painting attributed to Rembrandt, 

was Berlin’s most famous artwork for dec- 

ades. Once evidence emerged, in the 1980s, 

that the painting was not by Rembrandt, it 
lost much of its artistic and economic value, even 
though the artwork itself had not changed (J). 
Quality in art is elusive; art appeals to individual 
senses, pleasures, feelings, and emotions. Recog- 
nition depends on variables external to the work 
itself, like its attribution, the artist’s body of work, 
the display venue, and the work’s relationship to 
art history as a whole (2, 3). Recognition and value 
are shaped by a network of experts, curators, col- 
lectors, and art historians whose judgments act as 
gatekeepers for museums, galleries, and auction 
houses (4). Given the fragmented and often secre- 
tive nature of transaction records, quantitative 
analyses of the art world have been difficult (5, 6). 
Although artists’ reputation is known to affect 
auction outcomes, our current understanding 
of these processes is based on small samples 
spanning short periods and limited to a coun- 
try or region (7-9). 

Our dataset was collected by Magnus (www. 
magnus.net) and combines information on artists’ 
exhibitions, auction sales, and primary market 
quotes. It offers information on 497,796 exhi- 
bitions in 16,002 galleries, 289,677 exhibitions in 
7568 museums, and 127,208 auctions in 1239 auc- 


'Network Science Institute, Northeastern University, Boston, 
MA, USA. *Harvard Institute for Quantitative Social Sciences, 
Cambridge, MA, USA. *Department of Mathematics and its 
Applications and Center for Network Science, Central 
European University, Budapest, Hungary. “Complexity 
Science Hub, Vienna, Austria. °IS| Foundation, Turin, Italy. 
®University of St Gallen, St. Gallen, Switzerland. “Zagreb 
School of Economics and Management, Zagreb, Croatia. 
8Division of Network Medicine, Department of Medicine, 
Harvard Medical School, Boston, MA, USA. “Department of 
Network and Data Science, Central European University, 
Budapest, Hungary. 

*Corresponding author. Email: alb@neu.edu (A.-L.B.); c.riedI@ 
neu.edu (C.R.) 


Fraiberger et al., Science 362, 825-829 (2018) 


tion houses, spanning 143 countries and 36 years 
(1980 to 2016, fig. S1), allowing us to reconstruct 
the artistic career of 496,354 artists (see supple- 
mentary text S1 for additional description and 
validation and fig. Sla for an example) (JO, 11). 
The number of exhibitions for an artist followed 
a fat-tailed distribution; whereas 52% of the 
artists had one recorded show, a few high-profile 
artists were exhibited at an exceptional number 
of venues (fig. S1, c and d). Although half of 
the auctioned artworks sold for less than $4000, 
the price for art was as high as $110,500,000 
(fig. S1f). 

Prestigious institutions have access to well- 
regarded artists, and influential artists in turn 
tend to seek out prestigious institutions. Yet, 
institutional prestige is also highly subjective, 
determined by factors like history, leadership, re- 
sources, and geographic location. Given that 
major institutions act as art portfolios, we can 
uncover the slowly changing institutional pres- 
tige from frequent artwork exchanges, an ap- 
proach called “adiabatic approximation” (12). For 
this, we define an order t coexhibition network, 
whose nodes are museums and galleries, con- 
nected by weighted directed links (i, 7) that rep- 
resent the number of artists that exhibited first 
ini then inj within a window of t exhibits (fig. 
$2, a and b) (13). The obtained order t = oo 
coexhibition network, connecting 16,002 galleries 
and 7568 museums as nodes via 19,031,332 links, 
incorporates all art movement in our dataset. A 
subset of this network revealed the clustering 
inherent in the art world (Fig. 1 and figs. S3 and 
S4). The network core was a dense community of 
major European and North American institu- 
tions, underlying their access to a common pool 
of artistic talents. Movement between the hubs in 
the core was exceptionally high: The link weight 
between Museum of Modern Art (MoMA) and 
Guggenheim was 33 times higher than expected 
if artists would move randomly between insti- 
tutions (supplementary text S2.1), reflecting a 
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highly concentrated movement of selected artists 
between a few prominent institutions. Multiple 
dense regional communities of institutions in 
Europe, Asia, South America, and Australia were 
relatively isolated from the core, indicating that 
members of these communities share artists 
mainly among themselves. 

A network-based ranking using each institu- 
tion’s eigenvector centrality (14) was strongly 
correlated with known prestige measures (sup- 
plementary text S2.4 and fig. S5): (i) N = 9392 
institutions were independently assigned grades 
from A to D by a team of experts at Magnus based 
on criteria including longevity, the artists exhib- 
ited, size and quality of exhibition space, and art 
fair participation. A-rated institutions had high 
network-based ranking, whereas those rated D 
were at the bottom half (Fig. 2A). (ii) For each 
institution, we computed the maximum relative 
price taken across all the artworks exhibited, 
observing a high correlation between network- 
based ranks and economic value of the exhibited 
artists artworks (Fig. 2B). The top 10-ranked 
institutions had the highest cumulative sales 
values (Fig. 2C and fig. S6), indicating that the 
coexhibition network, though its construction is 
agnostic to price, identified institutions that have 
access to highly valued artists. In general, an in- 
stitution’s geographic distance to one of the 10 
largest hubs showed no relationship with pres- 
tige (fig. S7, a and b). By contrast, the network- 
based distance of an institution to one of the top 
10 institutions was closely linked to its prestige 
(fig. S7, c and d). Thus, network effects play a 
defining role in influencing the evolution of an 
artist’s reputation and valuation. 

To show that artistic careers can be interpreted 
within the context of the institutions to which 
they have access, we grouped artists by the av- 
erage prestige of their first five exhibits. We 
assigned an artist a high initial reputation if 
her work was on average exhibited in the top 
20% of institutions as defined by network rank- 
ing; an artist had low initial reputation if his 
work was shown on average in the bottom 40% 
(supplementary text $3.1). A decade after their 
fifth exhibit, 39% of the high-initial reputation 
artists continued to exhibit (Fig. 2D). For low- 
initial reputation artists, only 14% remained 
active 10 years later. Next, we selected 31,794 
artists, born between 1950 and 1990 with at 
least 10 exhibitions (Fig. 2E). As a group, high- 
initial reputation artists had continuous access 
to high-prestige institutions during their entire 
career (Fig. 3A). Of the 4058 high-initial reputa- 
tion artists, 58.6% remain in high-prestige ter- 
ritory until the end of their recorded career, and 
only 0.2% had the average prestige of their five 
most recent exhibits in the bottom 40% (Fig. 2F). 
This lock-in effect was largely absent for low- 
initial reputation artists: Their reception improved 
with time, advancing slowly to institutions of 
increasing prestige (Fig. 3A). Only 10.2% of low- 
initial reputation artists had the average pres- 
tige of their five most recent exhibits in the top 
20% (Fig. 2F). Overall, initial reputation (first five 
exhibits) predicted success across a variety of 


lof5 


810z ‘6| sequweAoN UO /Hio BewWweouUs!os'e0uUsI0S//:di1y Wo pepeojuMOGg 


RESEARCH | REPORT 


@ Spain @ Austria @ Central America @ Japan @ Africa © Rank 1 
@ UK @ Switzerland @ Canada @ Other Asia @ South America @ Rank 10 

@ Rank 100 
@ France @ Italy @ United States ® Oceania * — Rank 1000 
@ Germany @ Other Europe - Rank 10000 


” Jiro Miura Gallery 
» Jundt Art Museum 


Singapore 
Art Museum 


r ) © 
@ MARTa Herford 


Victoria and «es 
Albert Museum 


Galerie Clairefontaine 


Bonnefantenmuseum 


safssen Museum 
luseum of A\ 


immer plus 


ae 
Pi 


National Galler 
of Australia 


tel Gallery ox 


ne e 
. *" Kunsthallé arfitmfarigen Chemnitz M hauser 
‘unsthal : niger usin een" 


je Mus 
Galerie Lena Bruning * etn 


* 
Vanderbilt University 
Fine Arts Gallery 


nt sass Design Museum 
aie ne le Collection 
oy Mi 


National Gallery of 
Arts - Tirana 


Art Attitude Hervé Bize 


Yancey Richardson Gallery 


Carrie Secrist Gallery 


Galleria Torbandena 


museum Milheim an der Ruhr 


Antiguo 
Colegio de 
San Idefonso 


Fig. 1. Coexhibition network. Force-directed layout of the order t = oo visualization purposes, we only show the 12,238 nodes corresponding to 
coexhibition network, whose nodes are institutions (galleries, museums). institutions with more than 10 exhibits; we pruned the links by keeping the 
Node size is proportional to each institution's eigenvector centrality. Nodes most statistically significant links (20) (supplementary text S2.2). We 

are connected if they both exhibited the same artist, with link weights implemented community detection on the pruned network (21), identifying 
being equal to the number of artists’ coexhibitions. Node colors encode the 122 communities (supplementary text S2.3). We highlighted five of them, 
region in which institutions are located. Links are of the same colors as the full community breakdown being shown in fig. S3. We also show the 
their end nodes, or gray when end nodes have different colors. For names of the most prestigious institution for each community. 
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measures: High-initial reputation artists had 
twice as many exhibitions as low-initial reputa- 
tion artists (Fig. 2G); 49% of the exhibitions of 
high-initial reputation artists occurred outside 
of their home country, compared to 37% for low- 
initial reputation artists (Fig. 2G), and high- 
initial reputation artists showed more stability 
in institutional prestige (Fig. 2H). The work of a 
high-initial reputation artist was traded 4.7 
times more often at auctions than that of a 
low-initial reputation artist (Fig. 21), at a 
maximum price that was 5.2 times higher 
(Fig. 21). We also collected 442,314 prices of 
artworks displayed in galleries, finding that the 
average maximum price of high-initial reputa- 
tion artists was $193,064, compared to $40,476 


showed a lower dropout rate and tended to 
maintain their status. By contrast, those starting 
at the periphery of the network showed a high 
dropout rate, but if they persisted, their access to 
top institutions gradually improved. 

To model how reputation emerges in the art 
world, let p[z,41|2:] be the probability that an 
artist, currently exhibited at institution 7,, next 
exhibits at institution 7,,,. We assume that the 
only institutions 7,,; reachable for the artist are 
those that have exhibited an artist from institu- 
tion 7, before. We can therefore model an artistic 
career as a random walk on the order t = 1 network 
(15, 16), the probability of moving to 7,,, being 
proportional to the number of previous artists 
who transitioned from 7, to 7,,1 (fig. $2). We 


directs them toward institutions of median pres- 
tige (Fig. 3B), failing to capture the lock-in effect 
observed in real careers. This suggests that access 
to institutions also depends on the artist’s pre- 
vious exhibition history, not only on current 
exhibition venue. To consider an artist’s previous 
exhibition history 7, %,..., 7. (7), we write the 
probability of the 7, _, 7,,, transition as 


Ditrsaltr, 4) =K x ulmi.,.3 Mr] x Ditwalé] (1) 


where K is a normalization factor and the sec- 
ond term on the right-hand side captures the 
memory of the system about artists’ reputations, 
written as 


for low-initial reputation artists (Fig. 2H). Thus, assume that the network captures the connec- shee gme] = Piri, {Mx (2) 
. . che . . * 4: 1 SS 
art careers were characterized by strong path | tions between curators and institutions, guiding fe piri..,| 
dependence; artists starting in high-prestige | access to specific institutions. Independently of 
institutions located at the center of the network | where artists started their career, this model | where 
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Initial Reputation ( 70-5) 


Fig. 2. Quantifying artistic careers. (A) Network-based prestige ranks, 
captured by eigenvector centrality, for institutions that were independently 
assigned different grades. (B) The relationship between sales-based 
ranks and eigenvector centrality-based network ranks, binned in 

100 intervals, showing a high Spearman's correlation (ps = 0.88). We 
report mean (black line) and standard error (gray shading) within each bin. 
(C) Data on top 10 institutions as predicted by the network-based ranking. 
Colors capture geographical location, as shown in Fig. 1. (D) Survival 
curves, showing the fraction of artists that continue to exhibit in the years 
following their first five exhibits based on the career of 99,265 artists 
with more than five exhibits. (E) Probability density function of average 
prestige during the first five exhibits for the 31,794 artists with more than 
10 exhibits born between 1950 and 1990. (F) Diagram illustrating how the 


Initial Reputation (70-5 ) 


Initial Reputation (70-5 ) 


career high— and low-initial reputation artists evolves, showing the 
fraction of those artists whose final reputation (last five recorded exhibits) 
is either low or high. To show how the early career determines various 
success measures across a career, we consider as control variable the 
average prestige of the first five exhibits of an artist, and report (G) the 
total number of exhibits (left), the percentage of these exhibits outside of 
their home country (right), (H) the standard deviation of their exhibition 
prestige (left), the maximum price at which they are currently quoted in a 
gallery (in $, right), (I) the total number of their works that were sold in 
the auction market (left), and the maximum price (relative to the average 
market price) at which their work sold in the auction market (right). 

Each panel demonstrates the important role that initial reputation plays 
in shaping later access to institutions and financial reward. 
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Fig. 3. Modeling the emer- A 
gence of reputation. 1.0 
A) For a random sample 
including 30% of the 31,794 
artists with more than 

10 exhibits born between 
1950 and 1990, we show 
the evolving exhibition 
prestige over time. 0.0 

B) Evolving exhibition 0 5 
prestige predicted by the 
random walk model (mem- 
oryless), documenting its 
failure to capture real 
careers. (C) The memory 
model predicts the evolu- 
tion of prestige. We use the 
first five exhibits to initialize 
the models. The sequence 
of dates at which an artist's 
exhibitions occur was 
matched to the one we 1 
observe in the data. (D to 0.0 02 
F) Variation of the memory 
component with the prestige 
of the next exhibit x, for G 
different ranges of values for 
past reputation m. x and m 
are reported in decile. > 
(G) Probability density 2 0. 
function of average prestige x 
during the first five exhibits 
for the 31,794 artists, and 


Prestige ( 7 ) 


o 


Memory Effect ( 4/7’; m/) 


0.00 0.60 
the subset of those artists 0.0 02 04 06 O8 1.0 0.4 
who were born in the Initial Reputation (to. 5) 
United States, Canada, and 
India. (H) Final reputation 
versus initial reputation for 
artists of different country of 
origin. 
12 to a high-prestige one. We observe the opposite 
m, = =) hi on (3) trend for an artist whose previous reputation 
Ti was in the top decile, e.g., m = 0.9 (Fig. 3F): Her 


is the average reputation, representing the aver- 
age prestige of the artist’s past 7, exhibitions. In 
other words, memory acts as a multiplicative 
weight that depends on the average past reputa- 
tion of the artist and the prestige of the target 
institution. This allows us to measure the memory 
term u[7;,,,;7-] directly from the data, helping 
us document strong reputation effect for all 
artists (supplementary text S3.2 and Fig. 3, D 
to F). Consider an artist whose previous ex- 
hibitions conferred an average reputation in the 
bottom decile, e.g., 7 = 0.1 (Fig. 3D). His chances 
of exhibiting next at an institution whose pres- 
tige m is in the bottom decile was 3.4 times 
higher than expected by chance, and his prob- 
ability of moving to a top-decile institution 
was only one-fifth of that expected by chance. 
The monotonically decreasing 1[7;,,,;7,] with 
prestige x indicates that an artist with low pre- 
vious reputation had a 17 times higher chance of 
moving next to a low-prestige institution than 
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relative chances of exhibiting once again at a high- 
prestige institution were 42 times higher than 
moving to a low-prestige institution. 

To test the role of reputation, we simulated the 
career of each artist in our sample, using as input 
only their first five exhibits and the universal 
(artist-independent).[7;.,, ; 77,] functions to decide 
where they would exhibit next. The model accu- 
rately captured the lock-in effect observed in real 
careers (Fig. 3C). The forecast error saturated 
beyond 7, = 12 (supplementary text 3.3 and fig. 
S8a), indicating that the past 12 exhibitions of- 
fered an optimal memory to capture the role of 
reputation in artistic careers. The modeling 
framework did not predict the specific institu- 
tions that exhibit an artist, but only their level of 
prestige (figs. S8, b to h, and S9). This is partly 
because there are many institutions within 
each community, with comparable prestige. 

As Fig. 2F illustrates, 240 artists who began 
their career in low-prestige institutions did break 
through, having the average prestige of their last 
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five recorded exhibits in high-prestige institutions. 
We find that those who do break through do so 
within the first 10 years of their careers (fig. $10a). 
We also find that among their first five exhibits, 
breakout artists exhibit in institutions with a 
wider range of rankings, their initial prestige 
standard deviation being 18.6%, compared to 
10.3% for those who did not break through (p = 
10°°?, fig. S10b); they exhibit in more distinct 
institutions, their initial fraction of exhibitions 
in distinct institutions being 70.3%, compared to 
49.3% (p = 107), fig. S10c); have higher maxi- 
mum exhibition prestige (0.60 compared to 0.41, 
p=10 ~, fig. $10d); and their network distance to 
MoMA is equal to 0.48, compared to 0.60 (p = 
10-6, fig. S10e). In other words, later access to 
high-prestige institutions is improved by an inten- 
sive early “shopping around.” 

Although talent is difficult to measure, we ex- 
pect an artist’s talent to be uncorrelated with 
their country of origin, implying that the distri- 
bution of initial reputation should not vary across 
artists of different origin. However, initial repu- 
tation was not equally distributed across artists 
of different country of origin (Fig. 3G). In many 
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countries, artists start and end their career in low- 
prestige institutions (Fig. 3H); those, however, 
born in countries with better access to the art 
network have a higher chance of starting and 
ending their career at the top. 

Our analysis focused on art surveyed by 
galleries, museums, or auction houses, so non- 
object-based art, like performance art, was 
underrepresented. We also focused on success 
measures tied to institutional access, ignoring 
multiple dimensions through which art and art- 
ists enrich our society (78). Yet, even with this 
limited focus, our results codify the stratification 
of the art world, which limits access of artists 
to institutions that would be beneficial to their 
career. Quantifying these barriers and the mech- 
anism of access could help establish policies to 
level the playing field. For example, the art world 
could benefit from the implementation of lottery 
systems that offer some underrepresented artists 
access to high-prestige venues, or blind selection 
procedures, successfully implemented in classi- 
cal music (79), enhancing the inclusion of ne- 
glected works and artists. 
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MASS SPECTROMETRY 


Protein assemblies ejected directly 
from native membranes yield 
complexes for mass spectrometry 


Dror S. Chorev', Lindsay A. Baker”, Di Wu’, Victoria Beilsten-Edmands’, 
Sarah L. Rouse®, Tzviya Zeev-Ben-Mordehai”*, Chimari Jiko*, 

Firdaus Samsudin’, Christoph Gerle®’, Syma Khalid’, Alastair G. Stewart®®, 
Stephen J. Matthews”, Kay Griinewald””®, Carol V. Robinson'+ 


Membrane proteins reside in lipid bilayers and are typically extracted from this environment 
for study, which often compromises their integrity. In this work, we ejected intact 
assemblies from membranes, without chemical disruption, and used mass spectrometry 
to define their composition. From Escherichia coli outer membranes, we identified 

a chaperone-porin association and lipid interactions in the B-barrel assembly machinery. 
We observed efflux pumps bridging inner and outer membranes, and from inner membranes 
we identified a pentameric pore of TonB, as well as the protein-conducting channel SecYEG 
in association with F|Fo adenosine triphosphate (ATP) synthase. Intact mitochondrial 
membranes from Bos taurus yielded respiratory complexes and fatty acid—bound dimers 
of the ADP (adenosine diphosphate)/ATP translocase (ANT-1). These results highlight 

the importance of native membrane environments for retaining small-molecule binding, 
subunit interactions, and associated chaperones of the membrane proteome. 


enes encoding membrane proteins con- 

stitute 20 to 30% of the genome of all 

living cells and perform critical processes 

ranging from mediating drug resistance in 

bacteria to facilitating the complex mito- 
chondrial respiratory chain in humans. Recent 
developments in structural biology, including 
high-resolution cryo-electron microscopy (cryo- 
EM), are uncovering new structures and roles of 
membrane proteins (7). Often, subunit stoichi- 
ometry and lipid binding properties of complexes 
extracted in detergent micelles have been con- 
troversial, prompting development of native 
mass spectrometry (nMS) methods. Now broadly 
accepted for retaining the stoichiometry of 
soluble complexes (2), recent developments in 
nMS of membrane protein assemblies have 
not only uncovered subunit stoichiometries but 
have also found roles for lipids in modulating 
structures (3, 4). To reveal stoichiometry and 
lipid binding in the absence of detergents, alter- 
native nMS approaches have been developed to 
analyze bicelles (5), amphipols (6), nanodiscs (7), 
and styrene maleic acid copolymer lipid particles 
(8). All of these approaches require some chem- 
ical intervention and high levels of protein ex- 
pression, thereby restricting their use primarily 
to proteins overexpressed in bacteria. In this study, 
we aimed to overcome these limitations and 
show that we can obtain mass spectra for pro- 


tein assemblies ejected directly from native mem- 
branes of prokaryotic and eukaryotic organisms 
and uncover many previously uncharacterized 
interactions in the process. 

To develop this approach, we first used mem- 
brane protein-enriched extracellular vesicles 
(MPEEVs) overexpressing the epithelial fusion 
failure protein 1 (EFF-1), reported to be monomeric, 
or the anchor cell fusion failure protein 1 (AFF-1) 
with unknown stoichiometry (9, 10). MPEEVs 
of both proteins from Syrian hamster BHK21 
cultured cells were prepared and characterized 
as described previously (9). The presence of ei- 
ther EFF-1 or AFF-1 increased the diversity of 
cardiolipins (CDLs), as was confirmed by stan- 
dard approaches (fig. S1, A and C). By subjecting 
these vesicles to sonication to destabilize their 
integrity (materials and methods and fig. S2) 
and to high energy across a modified Orbitrap 
MS (1D, we released monomeric EFF-1 and intact 
dimeric AFF-1 directly from vesicles (fig. S1B). 

Having established the feasibility of our ap- 
proach, we investigated its application to addi- 
tional native membranes. We separated outer 
and inner membranes of Escherichia coli via a 
sucrose gradient, prepared vesicles, and used 
proteomics to identify membrane proteins (12) 
(fig. S3). To interpret the mass spectra, we de- 
veloped and applied a protocol that accounts 
for peak width, collision-induced dissociation 


(CID), and accurate mass, only accepting solu- 
tions within +~0.3% of calculated masses (fig. 
$2 and tables S1 to S3). Starting from the low- 
mass/charge ratio (m/z) range of the spectrum 
recorded for E. coli outer membranes, we assigned 
BamC with a lipid anchor, a component of the 
B-barrel assembly machinery (BAM) (Fig. 1A) 
(13). Moving to higher m/z, we assigned DnaK, 
implicated previously in the assembly of outer 
membrane porins (/4) and confirmed via CID of 
a 143-kDa complex together with OmpA (fig. S4). 
Previous reports that DnaK coimmunoprecipi- 
tates with full-length pro-OmpA but not with 
pro-OmpA(A3) (4) implied that sequences out- 
side the 8 barrel are required to maintain ac- 
cessibility of DnaK binding sites. Our measured 
mass (within 0.10%) is consistent with adeno- 
sine diphosphate (ADP)-bound DnaK binding 
to proOmpA and associating with a second 
OmpA, likely through the C-terminal dimeriza- 
tion domain (1/5), to form OmpA:proOmpA: 
DnaK:ADP (Fig. 1B). 

Turning to the high-m/z region of the mass 
spectrum recorded for outer membranes, a pre- 
dominant series of peaks was assigned to BAM 
(13). After detergent extraction and overexpression 
of all five subunits on a single plasmid, structural 
studies yielded primarily a 1:1:1:1:1 stoichiometry 
for Bam subunits (A to E) (13, 16, 17). From native 
membranes, however, a hexamer was ejected 
with a subunit composition of ABCD(E), (Fig. 1, 
A and D). A small population of this complex 
had been observed previously from recombinant 
preparations (17). Because a domain swapped 
dimer was observed by x-ray analysis of BamE 
alone (1/8) and nuclear magnetic resonance 
(NMR) solution studies were consistent with 
a population of BamE dimers (79), we docked 
a BamE dimer into the BamABCD complex and 
used molecular dynamics (MD) simulation to 
test its stability (20). The complex remained 
stable for 5 us, after heating to 323 K, consistent 
with its viability in the E. coli outer membrane 
lipid environment. A second series was assigned 
to the pentameric BamABCDE complex, its 
diffuse peaks consistent with binding of up to 
three CDL molecules (Fig. 1C and fig. S5). Pref- 
erential binding of phosphatidylglycerol over 
CDL had been reported previously (19), which 
prompted us to explore lipid binding prefer- 
ences of BamE by means of MD simulations 
(three 5-us trials) (Fig. 1E and fig. S6). As many 
as three CDL lipids made contact, indicating 
CDL attachment through BamE, which likely 
anchors the complex to a region of the mem- 
brane high in CDLs and may contribute to a 
membrane targeting mechanism. 

Inner membrane vesicles prepared from E. coli 
represent a substantial challenge, as together 
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Fig. 1. Protein complexes 
ejected directly from 

E. coli outer membranes. 
(A) Peaks in the mass 
spectrum recorded 

at 400 V are assigned 

to BamC, Dnak, 
Dnak:OmpA:pro-OmpA, 
and two states of the Bam 
complex. The inset 
denotes observed 
complexes of an outer 
membrane vesicle. 

(B) Model of the 

OmpA dimer (15). The 


hydrophobic pro-sequence 3000 
(red) is a potential 

binding site for Dnak. 

(C) Expansion of the 

mass spectrum assigned 

to the Bam complex 

[boxed region in (A)], 

with monomeric BamE 
(BamABCDE) binding to 

one, two, and three 

cardiolipins (gray, green, 

and yellow, respectively). 

(D) Atomic structure 

of the BamE dimer 

(PDB: 2YH9) (orange and 

blue) docked into the 

Bam complex (PDB: 

5D00) with the BamE 

monomer removed. 

(E) MD simulations of E 
the BamABCDE complex 
(cyan) with monomeric 
BamE (orange) and two 
(left) and three (right) CDL 
molecules (red). 


igh 


they contain a minimum of 42 different proteins 
(72) and yield complex spectra for assignment 
(fig. S7). We used the heterogeneity of cofactor 
binding to first identify cytochrome bo; and 
the CydAB cytochrome bd oxidase complex. 
Peaks corresponding to (CyoB)s(CyoC),(CyoD),, 
with one or two HemeO; and HemeB factors and 
additional CDL binding (diffuse peaks), imply 
that lipid binding stabilizes structures with the 
full-heme complement, supported by reduced 
charge state (Fig. 2A) and in line with the pro- 
posed dimer association for cytochrome bo; from 
native membranes (2/7). Extensive peak splitting 
attributed to different heme groups (B558 and 
B595) and ubiquinol helped to identify the CydAB 
cytochrome bd oxidase complex (Fig. 2B and fig. 
S8). Both CydX and the paralogous small trans- 
membrane protein AppX have the potential to 
interact with the CydAB complex and have over- 
lapping cellular functions (22). From the native 
membrane, we found that CydX and AppX were 
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able to interact simultaneously with CydAB to 
form a heterotetramer. 

We next assigned, on the basis of mass, parts 
of the energy-transducing Ton complex located 
within the inner membrane. In the inner mem- 
brane, three integral membrane proteins reside: 
ExbB, ExbD, and TonB. From x-ray crystallography, 
a second copy of ExbD was located within the 
pentameric ExbB pore (23), whereas from EM, 
both hexameric and pentameric assemblies were 
defined (24). Our results confirm the existence 
of only the pentameric pore within the native 
membrane, with measured charge states imply- 
ing trapping of one ExbD protomer within the 
compact globular complex (Fig. 2B). 

At the higher-m/z region, subassemblies of 
multidrug efflux pumps, including AcrAB-TolC 
and the less well characterized but related pump 
MadtABToIC (25), spanning both membranes were 
uncovered (Fig. 2C). For AcrAB-TolC, all three 
inner membrane subunits (in AcrB) are pre- 
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served and bound to the recently discovered small 
subunit (AcrZ), as evidenced by mass spectra; 
thought to modulate substrate preference (26); 
and modeled into cryo-EM structures (27). One 
copy of the outer membrane protein TolC is 
bridged by a single copy of the periplasmic sub- 
unit AcrA to the inner membrane complex, 
yielding AcrB3:AcrZ»:AcrA:TolIC. In the case of 
MdtABToIC, dimeric MdtB remains assembled 
with (MdtA); and (TolC), in the outer mem- 
brane (MdtB,MdtA;:TolC,). Because all three 
MatA subunits remain attached, they are likely 
supported by dimeric MdtB in the inner mem- 
brane, consistent with the role of MdtC in sub- 
strate binding (28) and not in supporting 
periplasmic subunits. During the sonication 
process and MS analysis, AcrABZ-TolC under- 
goes more extensive disassembly than MdtAB- 
TolC, which remains largely intact with charge 
states (fig. S9) indicative of highly charged sub- 
units from AcrABZ-TolC undergoing CID (29). 
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Fig. 2. Regions of the mass spectrum recorded for inner membranes 
from E. coli yield cytochromes, the Ton complex multidrug transporters, 
and the intact ATP synthase in complex with the SecYEG translocon. 

(A and B) Expanded regions of the spectrum assigned to cytochrome 
bo3 and cytochrome bd oxidase, showing peak splitting due to binding of 
quinol and heme groups (fig. S8). The pentameric ExbB complex (with 
one copy of ExbD in the center of the pore) that forms part of the TonB 


The fact that both complexes survive at least in 
part, however, points to new ways of studying 
the effects of antibiotics on the assembly and 
conformational change of these multidrug re- 
sistance pumps. 

At the highest m/z values, we also observed 
peak splitting due to ADP/ATP binding, which, 
together with dissociation of subunits with the 
mass of the c subunit, is indicative of ATP syn- 
thase (Fig. 2C and figs. S10 and S11). Mass dif- 
ferences between populations were assigned to 
binding of SecE, SecY, and SecG, consistent with 
SecYEG remaining in contact with the F}Fo ATP 
synthase, as reported previously for insertion of 
subunit a (30). We next considered the stoichi- 
ometry of the Fo ring. Early reports had suggested 
a variable stoichiometry depending upon meta- 
bolic conditions (37), and our data are consistent 
with 12 c subunits in the Fo ring. We therefore 
conclude that, in the native membrane, inter- 
actions between F,Fo ATPase and SecYEG are 
maintained after insertion in the membrane, 
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together with c subunits in Fo, which are either 
lost during detergent extraction (32) or filtered 
out via other methods. 

Observation of E. coli ATPase prompted us 
to consider inner mitochondrial membranes, 
densely populated with protein complexes re- 
sponsible for control of the proton gradient and 
oxidative phosphorylation between the inter- 
membrane space and the inner mitochondrial 
matrix (33). Surprisingly, inner mitochondrial 
membranes from Bos taurus yielded no sub- 
stantial subassemblies of complex I or complex V 
(Fig. 3A). Other complexes in the respiratory 
chain that were observed include monomeric 
complex III; monomeric complex IV, with lipid 
and cofactor occupancy (34); and dimeric com- 
plex II, with seven core subunits confirmed by 
dissociation of cytochrome b and UQCRB (Fig. 
3A and fig. S12). 

Notably, the most abundant protein in the mass 
spectrum of the inner mitochondrial membrane 
was adenine nucleotide translocase 1 (ANT-1) 
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complex is also observed (yellow). (C) High-m/z region of the mass 
spectrum assigned to multidrug efflux pumps AcrAB and MdtAB 

and the intact ATP synthase. Expansion of peaks assigned to 

the ATPase reveals binding of the SecY (blue), SecYG (green), and 
SecYEG (orange) charge states 52+, 53+, and 54+. Complexes 
observed in mass spectra are shown schematically, with subunits that 
have dissociated shown in gray. 


(Fig. 3A). The stoichiometry of ANT-1 has re- 
mained controversial, with monomeric structures 
of ANT-1 and UCP2 solved by x-ray crystallog- 
raphy and NMR and proposed to be functional 
(35-37). However, in vivo and biochemical ex- 
periments were consistent with the functional 
unit of ANTs and uncoupling proteins (UCPs) 
being dimeric (38), a proposal supported by MD 
simulations on a short time scale (39). MS of 
ANT-1 revealed that it is predominantly dimeric 
with low-occupancy binding of a number of sat- 
urated fatty acids (palmitate anions), indicative 
of a transport mechanism rather than a specific 
binding interaction (Fig. 3A and fig. S13). MD 
simulations in lipid bilayers showed that, within 
10 us, tightly bound dimers formed predominantly 
if CDL was present in the inner leaflet of the 
membrane (fig. S14). Fatty acid binding was 
also observed with the palmitate head group 
buried between two helices in each subunit 
(Tyr'®? and Phe”) (Fig. 3B, fig. S14D, and 
movie S1). In situ binding of multiple palmitate 
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anions within the dimer ejected from the native 
mitochondrial membrane provides direct evi- 
dence to support the role of this fatty acid in the 
control of uncoupling through ANT-1-mediated, 
transport (40). 

Because complexes I and V were largely ab- 
sent from the spectra of inner mitochondrial 
membranes, we applied our protocol to intact 
mitochondria, without prior separation of inner 
and outer membranes, reasoning that the outer 
membrane and noninverted protein orientation 


may protect complexes exposed during sonica- 
tion. The resulting mass spectra again revealed 
ANT-1 dimers bound to palmitate (fig. S13), as 
well as lipid-bound subassemblies of complex I 
bound to its flavin mononucleotide (FMN) co- 
factor and lipids, a complex IV dimer (Fig. 3D 
and fig. S15), as well as intact complex V bound to 
nucleotides. 

The release of complexes I and V, only when 
protected by the outer membrane, prompts con- 
sideration of the mechanism of direct ejection 


from native membranes. The three steps to con- 
sider are (i) formation of vesicles, (ii) sonication 
in ammonium acetate to disrupt vesicles, and 
(iii) mass spectrometry under high electric fields 
from —400 to —700 V. During the first step, ves- 
icles are formed from membrane preparations 
with protein complexes in both orientations. 
For purified inner mitochondrial membranes, 
we observe primarily inside-out vesicles with 
ATPase F, heads visible in cryo-EM images 
(fig. S16). We anticipate that sonication may 
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membranes. Subunits 
shown in gray have 
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simulation of ANT-1 after 
2.1, 2.5, and 2.6 us (left 
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metric membrane 
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the matrix leaflet). The 
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from intact mitochondria. Intact complex V is observed with associated nucleotides, together with a dimer of complex IV (fig. S15) and partial assemblies 
of complex | (the charge state of the subassembly lacking NUFS3 is z = 56+), in the absence of the catalytic core, bound to FMN. Inset schematics 
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form defects in vesicles, thus allowing ingress 
of the ammonium acetate buffer used for elec- 
trospray. The third stage, in which high volt- 
age is applied, favors expulsion of complexes 
charged by the electrospray process and at- 
tracted by the high electric field. Pronounced 
differences are observed between the two prep- 
arations from mitochondria. For inner mem- 
branes, we attribute the virtual absence of intact 
mitochondrial ATP synthase to exposure to 
shearing forces during sonication. Similar ar- 
guments could be made for the absence of com- 
plex I from inner membranes; its exposed 
hydrophilic peripheral arm may have sheared 
during sonication, leaving the hydrophobic 
membrane-embedded complex less susceptible 
to charge. By contrast, for intact mitochondria, 
complexes I and V are ejected largely intact, 
supporting the hypothesis that exposure to 
shearing forces during sonication is a key de- 
terminant in survival of complexes ejected from 
native membranes. 

Although full details of the mechanism of 
ejection from native membranes into vacuum 
are the subject of ongoing research, the data pre- 
sented here establish a detergent- and chemical- 
free MS approach that overcomes potential 
artifacts introduced by the use of these reagents. 
The number of new interactions of membrane 
proteins uncovered, with lipids, chaperones, 
and cofactors in association, is a testament to 
the stability endowed by the native membrane. 
Notably, access to the protein ensemble of dif- 
ferent membrane compartments, at unparallel- 
ed mass resolution, will enable a new perspective 
on the effects of drugs and disease-associated 
mutations on target complexes within the con- 
text of their native membrane environments. 
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Small-molecule inhibitor of OGG1 
suppresses proinflammatory gene 
expression and inflammation 
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Geoffrey Masuyer*, Olga Loseva’, Oliver Mortusewicz', Elisée Wiita’, Antonio Sarno”®, 
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The onset of inflammation is associated with reactive oxygen species and oxidative 
damage to macromolecules like 7,8-dihydro-8-oxoguanine (8-oxoG) in DNA. Because 
8-oxoguanine DNA glycosylase 1 (OGG1) binds 8-oxoG and because Ogg1-deficient mice 
are resistant to acute and systemic inflammation, we hypothesized that OGG1 inhibition 
may represent a strategy for the prevention and treatment of inflammation. We developed 
TH5487, a selective active-site inhibitor of OGG1, which hampers OGG1 binding to and 
repair of 8-oxoG and which is well tolerated by mice. TH5487 prevents tumor necrosis 
factor—a—induced OGG1-DNA interactions at guanine-rich promoters of proinflammatory 
genes. This, in turn, decreases DNA occupancy of nuclear factor «xB and proinflammatory 
gene expression, resulting in decreased immune cell recruitment to mouse lungs. Thus, 


we present a proof of concept that targeting oxidative DNA repair can alleviate 


inflammatory conditions in vivo. 


pon exposure to proinflammatory agents, 

cells produce increased levels of reactive 

oxygen species (ROS), which induce oxi- 

dative DNA damage. Guanine is particularly 

vulnerable because it has the lowest oxida- 
tion potential among canonical DNA bases (J, 2), 
resulting primarily in 7,8-dihydro-8-oxoguanine 
(8-oxoG), particularly at guanine-rich promoter 
regions (3, 4). 8-Oxoguanine DNA glycosylase 
1(OGGI) binds with high affinity to 8-oxoG in 
double-stranded DNA to initiate DNA base exci- 
sion repair. In addition to this role, OGG1 has dis- 
tinct signal transduction functions (5-7), interacts 
with 8-oxoG in gene regulatory regions, and facil- 
itates gene expression (3, 7-12). These observations 
provide a potential explanation for the decreased 
inflammatory responses in Ogg-deficient (Oge77/~) 
mice (13-16), which are otherwise viable and 
largely healthy (17). Thus, we hypothesized that 
small-molecule OGG1 inhibitors may be clinically 


useful for the alleviation of inflammatory pro- 
cesses while still being well tolerated. 

To screen for OGGI1 inhibitors, we used a 
duplex oligonucleotide with the OGG1 substrate 
8-oxo-7,8-dihydro-2'-deoxyadenosine and an ex- 
cess of apurinic or apyrimidinic (AP) endonu- 
clease 1 (APE1), which acts downstream of OGG1 
and increases its turnover on damaged DNA (J8) 
(Fig. 1, A and B). We screened a library containing 
17,940 (table S1) and identified a hit molecule 
with a median inhibitory concentration (IC; 9) 
of 8.6 uM. During hit expansion, we developed 
TH5487 as a potent OGGI1 inhibitor with an IC; 
of 342 nM, whereas structurally similar analogs 
TH2840 and TH5411 were inactive, with IC; 
values exceeding 100 uM (Fig. 1, C and D, and 
figs. S1 to $3). This compound series was selec- 
tive for OGG]; did not affect the activity of other 
DNA glycosylases (fig. SSA and table S2) or 
various Nudix hydrolases and diphosphatases 


(table S3); and did not intercalate DNA (fig. 
S5B). Previously, a hydrazide-based small mol- 
ecule (O8) was reported to inhibit OGG1 with 
similar potency as TH5487 (19). O8 was found 
to inhibit catalytic imine formation in OGG1 
(19), and we observed an increase in the potency 
of O8 by omitting APE1 from the reaction, in 
contrast to TH5487 (table S4). APE readily re- 
leased fluorescence from a natural AP site but 
only partially from an AP-site substrate preincu- 
bated with O8 (fig. S5C). Thus, TH5487 primarily 
inhibited the DNA glycosylase activity of OGG1, 
whereas O8 appeared to interfere with down- 
stream f-lyase activity. To further validate OGG1 
inhibition by TH5487, we performed electropho- 
retic mobility shift assays, where OGG1 bound to 
8-oxoG:C-containing duplex oligonucleotide in a 
concentration-dependent manner (fig. S4C). The 
amount of OGG1-DNA complexes decreased in a 
dose-dependent manner upon addition of TH5487 
(Fig. 1E), demonstrating that TH5487 precludes 
OGG] from binding oxidized DNA in vitro. 

TH5487, but not the inactive analogs TH2840 
and TH5411, increased the melting temperature 
for OGGI1 in a concentration-dependent manner 
(Fig. 1F). Thus, TH5487-mediated protein desta- 
bilization did not account for the observed de- 
crease in enzyme activity, suggesting that TH5487 
binds OGGI similarly to 8-oxoG extruded from 
DNA. Supporting this, treatment with TH5487 
resulted in a lower deuteration for all peptides 
forming the active site cavity (Fig. 1G and table 
$5). Thus, TH5487 is a potent and selective active 
site inhibitor that prevents OGGI1 from binding 
to its DNA substrate. 

To identify the precise binding site for this 
class of inhibitors, we determined the x-ray crystal 
structure of mouse OGGI1 in complex with the 
more soluble analog TH5675 (Fig. 1H; figs. S4 
and S6, A to C; and table S6). TH5675 bound the 
active site (fig. S6D), albeit differently from the 
natural substrate (fig. S6E). Notably, the iodo- 
phenyl tail of TH5675 occupied the deeper hydro- 
phobic pocket flanked by Phe®”, Cys**’, and Met” 
and took the place of the 8-oxoguanine base. The 
central piperidyl linker was stabilized by hydro- 
gen bonds with the catalytic Lys**® and the 
backbone of Gly*’, the residue that distinguishes 
8-oxoguanine from guanine. The benzimidazolone 
core interacted with a lipophilic exosite, stabilized 
by Ile” and Leu®”’ in addition to a n-stacking in- 
teraction with His”” (20). Notably, the Asp*”” side 
chain was within hydrogen-bond distance of the 
solvent accessible amine, which corresponds 
to the bromine atom in TH5487 (fig. S6F). These 
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Fig. 1. Development and valida- 
tion of OGG1 inhibitors. (A) A 
fluorophore and a quencher on 
opposite strands are separated 
upon OGG1-mediated excision of 
8-oxo0A and APE1 incision at the 
resulting apurinic site, causing 

a local melting of the DNA helix. 
(B) Excision of 8-oxoA:C, but 

not undamaged substrates, by OGG1 
in the presence of APE1. Data are 
presented as averages + SD of 
three technical replicates from 
four independent experiments. 
A.U., arbitrary units. (C) Chemical 
structures of the OGGI inhibitors 


§ 


20 5 


A— G:C +APE1 
—1:— G:C +APE1 +1 nM OGG1 
—a— G:C +APE1 +10 nM OGG1 
—o— 80x0A:C +APE1 

—e— 80x0A:C +APE1 +1 nM OGG1 
—e— 80x0A:C +APE1 +10 nM OGG1 


Fluorescence (A.U.) 
3 


described herein. (D) Inhibition 
curves. 0.8 nM OGGI1 and 2 nM =~ 
APE1 were incubated with 10 nM S get 
OGG1 substrate and different con- = 
° 
° 


centrations of the indicated 50- 
compounds. Data are presented 
as averages of four technical 
replicates from at least two 
independent experiments (n = 2 
for TH2840 and TH5411, n = 33 for 
TH5487). (E) TH5487 precludes 
binding of OGG1 to damaged 

DNA. Ten nM of an OGG1-substrate 
duplex oligonucleotide was incu- 
bated with 100 nM OGGI1 and 

the indicated concentrations 

of TH5487. This prevented the 
formation of OGG1-DNA 
complexes in a dose-dependent 
manner. The figure is representa- 
tive of three independent 
experiments. (F) Differential 
scanning fluorimetry. OGG1 was 


10° 


incubated with SYPRO Orange and a dilution series of OGG1 inhibitors. 
TH5487, but not inactive analogs TH2840 and TH5411, confers the 
thermal stabilization of OGG1. Data are presented as averages + SD 
of three technical replicates from three independent experiments. 

Tm, Melting temperature. (G) Differences in deuterium uptake 
superimposed on an OGG1 model upon TH5487 binding (Protein Data 
Bank 1EBM). Colored regions show peptides protected from deuterium 
exchange. The molecular surface of TH5487 is shown as a semi- 


interactions were the result of a local confor- 
mational change in which the active site closed 
around the ligand (fig. S6G and movie S1). 

For OGGI1 inhibitors to be pharmacologically 
useful, they need to engage and inhibit OGG1 in 
cells. TH5487 increased the melting temperature 
of OGGI1 in human cells (Fig. 2A), demonstrating 
that TH5487 engaged its intended target in living 
cells and protected it from thermal denaturation. 
Furthermore, TH5487 impaired repair of genomic 
8-oxoG induced by KBrO;. TH5487 caused a sig- 
nificant increase in genomic 8-oxoG after 2.5 hours 
(Fig. 2, B and C), and at 24 hours, 50 + 8% of the 8- 
oxoG remained in the TH5487-treated cells (Fig. 
2C), without disrupting proliferation (fig. S7A). 
Thus, genomic 8-oxoG and TH5487 were well 
tolerated by cells. Furthermore, the decrease in 
genomic 8-oxoG was a result of repair processes 
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and not cellular replication. To further validate 
target engagement, we assessed the chromatin 
dynamics of OGG1-GFP (green fluorescent pro- 
tein) fusion proteins. Cells were treated with 
KBrOs and released into medium containing 
TH5487 or dimethyl sulfoxide (DMSO). Consist- 
ent with previous reports (27), OGG1-GFP fusion 
proteins were immobilized at genomic DNA le- 
sions introduced by KBrOs. Treatment with TH5487 
increased the nuclear mobility of OGG1-GFP both 
3 and 5 hours after KBrO; exposure (Fig. 2, D and 
E, and fig. S7, B and C), suggesting that TH5487 
prevented OGGI1 binding to its genomic sub- 
strate in living cells. 

OGGI1 binds 8-oxoG at gene regulatory regions 
to mediate transcriptional activation in response 
to inflammatory stimuli (3, 7-12). In the absence 
of functional OGGI, a decreased inflammatory re- 


16 November 2018 


10” 10° 10° 10° 
log concentration (M) 


transparent surface, and DNA is displayed as a ribbon. (H) X-ray 


use OGGI1 (gray) in complex with ligand (yellow). 


N and C termini are labeled. On the right is a close-up view of ligand 


no acid residues are marked; hydrogen-bond 
with black dashed lines. The view in (H) differs 
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sponse is observed (3, 12-16, 22). Because TH5487 
prevents OGGI1 from binding 8-oxoG in DNA, we 
examined if TH5487 could suppress proinflam- 
matory gene expression. In line with previous ob- 
servations (72), human embryonic kidney (HEK) 
293T cells lacking OGGI1 displayed a reduced 
induction of CXCLI [chemokine (C-X-C motif) 
ligand 1] mRNA after tumor necrosis factor-a 
(TNFa) stimulation (Fig. 2F and fig. S7, D and 
E). Treatment with 5 uM TH5487 decreased 
CXCL1 expression by >50% in wild-type but not in 
OGGI-knockout cells (Fig. 2F). Thus, the compound 
may be used to specifically inhibit OGG1-dependent 
proinflammatory gene expression. Because res- 
piratory epithelium is a key orchestrator of 
pulmonary innate immune responses (23), we 
stimulated a murine airway epithelial cell line 
(MLE 12) with TNFa (24), which increased the 
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expression of an array of proinflammatory cyto- 
kines as well as C-C and C-X-C chemokines (Fig. 
3, A to C, and fig. S8). Importantly, TH5487 de- 
creased the expression of the same genes to near 
pretreatment levels (Fig. 3, C and E, and figs. S8 
to $12). Inhibition was dose-dependent (Fig. 3D 
and fig. S10) and also observed with the potent 
inflammatory agent lipopolysaccharide (LPS) (25) 
(Fig. 3F and figs. S11 and $12). Crucially, TH5487 
decreased TNFo- and LPS-induced gene expres- 
sion in diploid human small-airway epithelial 
cells (hSAECs) as well (Fig. 3, G to I, and figs. 
S9, S10, and S12). 

ROS generate a localized increase in OGG1 
substrates in guanine-rich promoter regions 
(4, 6, 9, 10), including proinflammatory genes 
(3, 4, 12). Emerging evidence suggests that OGG1 
binding to gene regulatory regions exerts an epi- 
genetic role for 8-oxoG, causing OGGI1 to act as a 
modulator of gene expression (3, 4, 6-17). Guanine 
oxidation leads to sequential recruitment of OGG1 
and downstream transcriptional effectors (3, 8-1), 
such as nuclear factor kB (NF-«B), which is the 
main driver of both TNFoa- and LPS-induced 
proinflammatory gene expression (26). Consist- 
ent with the observation that TH5487 prevents 
OGGI1 from engaging damaged DNA in vitro and 


Fig. 2. TH5487 engages OGG1 in A 
cells, inhibits DNA repair, and 

alters OGG1 chromatin dynamics. 3 
(A) Cellular thermal shift assay. 
Jurkat A3 cells were treated with 

10 uM TH5487, and OGGI thermal 
stability was analyzed by immuno- 
blotting. Addition of 10 uM TH5487 
to cultured cells increased the 
melting point of OGG1 by 

3°C (n = 2 independent experiments). 
Actin was used as a loading control. 
(B) Induction of genomic 8-oxoG by 
KBrO3. Duplicate cultures of Jurkat A3 
cells were treated for 1 hour with 

20 mM KBrOs, and the amount 

of 8-oxoG in genomic DNA was 
determined by liquid chromatography— 
tandem mass spectrometry 
(LC-MS/MS). KBrO3 induced 

a >10-fold increase in genomic 
8-oxoG. Data are presented 

as averages + SD of four 

replicates from two independent 
experiments. dG, deoxyguanosine. 
(C) Repair kinetics of genomic 
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in cells (Figs. 1E and 2, D and E), we observed that 
TH5487 decreased the recruitment of OGGI1 to 
regulatory regions of proinflammatory cytokines 
in TNFo-challenged cells (Fig. 3J). Consequently, 
binding of NF-«B to the same regulatory regions 
was significantly decreased by TH5487 in the chro- 
matin of TNFo-exposed cells (Fig. 3K) and to its 
recognition sequence in nuclear extracts from 
mouse and human cells by TH5487 (Fig. 3L and 
fig. S13, A to C). In the presence of OGG1, TH5487 
decreased NF-«B occupancy on 8-oxoG-containing 
DNA, whereas TH5487 alone was unable to inhibit 
NF-«B (fig. $13, D and E). Thus, TH5487 decreases 
proinflammatory gene expression by perturbing 
DNA occupancy of NF-«B and potentially other 
OGGI-dependent transacting factors (3, 8-11). 
TH5487 had no effect on the release of NF-KB 
from its inhibitory complex (fig. $14, A to C) but 
inhibited inflammatory gene expression similar 
to BMS-345541, an IxB kinase inhibitor (27) (fig. 
S15A). Thus, both TH5847 and BMS-345541 in- 
hibit NF-«B function: TH5487 by preventing 
NF-«B binding to promoters (Fig. 3K) and BMS- 
345541 by inhibiting NF-«B activation (fig. $14, 
Ato C). This results in the same readout in the 
form of diminished induction of proinflammatory 
genes. The previously developed OGG1 inhibitor 


T (°C) B 

a ee eT tune 
TROQASTKR OO 4-— — 
rarTnuwow Ww 


8-oxoG per 10° dG 
nN 
l 


* 


NT KBrO; 


0h —DMSO — TH5487 
3h—DMSO— TH5487 


0 5 10 15 
time(s) 


5h—DMSO— TH5487 
T T T 1 


O8 (19) did not affect gene expression (fig. S15A), 
possibly because, in contrast to TH5487, it allows 
OGGI binding to damaged DNA (19) (fig. S15, B 
and C). 

In addition, TH5487 is metabolically relatively 
stable and well tolerated in mice (fig. SI6A and 
tables S7 to S10). To assess whether TH5487 
could down-regulate chemotactic (C-C and C-X-C) 
mediators (28) in vivo, we challenged mouse 
lungs with TNFo and profiled the gene expression 
of proinflammatory mediators. TNFa robustly 
induced the expression of pulmonary proinflam- 
matory genes, but a prophylactic injection of 
TH5487 decreased the expression levels (Fig. 4, 
A and B). Challenge with TNFo or LPS induced 
the robust recruitment of neutrophils to the 
airways, which was decreased by up to 85 + 5% 
by the prophylactic intraperitoneal administration 
of TH5487 (Fig. 4C and fig. S16, B to G). We then 
administered TH5487 at different time points be- 
fore or after challenge with TNFo and found that 
TH5487 reduced the pulmonary neutrophil count 
even when administered up to 9 hours after TNFo. 
challenge (Fig. 4D and fig. S17). Thus, TH5487 is 
efficacious in vivo, suggesting that the compound 
could be used for the treatment of inflammatory 
conditions. Finally, another potent and structurally 
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8-oxoG. Cells treated with 20 mM KBrO3 for 1 hour were washed and 
released into medium containing 10 uM TH5487 or 0.1% DMSO. 

Duplicate samples were taken at the indicated time points, and the genomic 
content of 8-oxoG was determined as in (B). TH5487 induced 

a notable delay in repair kinetics at 2.5-, 5- and 24-hour time points. 

Data are presented as averages + SD of four replicates from two 
independent experiments. (D) Fluorescence recovery after photobleaching 
(FRAP). Jurkat A3 cells expressing OGG1-GFP were treated with 16 mM 
KBrO3, washed, and released into medium with 10 uM TH5487 or 0.1% 
DMSO. A nuclear region was bleached, and recovery of fluorescence after 
photobleaching was recorded. Representative false-color images of DMSO- 
and TH5487-treated cells are shown. Dashed outlines indicate bleached 
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areas. Scale bar, 5 um. (E) Quantification of FRAP experiments. 

Ten uM TH5487 increased the nuclear mobility of OGG1-GFP at 3 and 

5 hours after KBrO3 treatment. Quantifications of two (O-hour) or three 
(3- and 5-hour) independent experiments are shown. RFU, relative 
fluorescence units. (F) TH5487 inhibits TNFa-induced CXCL1 gene 
expression in wild-type, but not in OGG1-knockout (KO), HEK293T cells. 
Cells were treated with 0.05% DMSO or 5 uM TH5487 for 1 hour and 
TNFa (20 ng/ml) for 30 min. CXCL1 mRNA levels were determined with 
quantitative polymerase chain reaction (qPCR). Data are presented 

as averages + SD from three independent experiments. For (B), (C), and 
(F), **P < 0.01, ***P < 0.001, ****P < 0.0001, and NS is not significant, 
using unpaired two-sided Student's t test. 
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distinct OGGI1 inhibitor was recently published 
(29). When tested, this compound had compa- 
rable anti-inflammatory effects (fig. S18). 

Thus, we have developed a pharmacologically 
useful OGG1 inhibitor that is a potent and selective 


active site binder that prevents OGG1 from en- 
gaging damaged DNA in vitro and in cells, result- 
ing in decreased proinflammatory gene expression 
by a mechanism that is distinct from other es- 
tablished therapeutic agents (fig. S19). This is 


translated into a reduced neutrophil infiltra- 
tion in mouse lungs challenged with TNFo or 
LPS, demonstrating that OGG1 inhibition may 
be a potentially useful strategy for the treatment 
of inflammation. 
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Fig. 3. Inhibition of proinflammatory gene expression and inflamma- 
tion by TH5487, an active site binder of OGG. (A to C) The effect of 
TH5487 on basal (A) and TNFa-induced expression of an array of proin- 
flammatory cytokines, chemokines, and receptors [(B) and (C)] in mouse 
airway epithelial cells (MLE 12). Data analyses were performed according to the 
manufacturer's instructions using their web-based software package (www. 
qiagen.com/us/shop/genes-and-pathways/data-analysis-center-overview- 
page/). (D) The dose-dependent inhibition of TNFo-induced Tnf mRNA levels 
by TH5487 in MLE 12. (E and F) TH5487 inhibits the TNFa- (E) and LPS- 
induced (F) expression of proinflammatory genes in MLE 12. (G) Dose- 
dependent inhibition by TH5487 of TNFa-induced expression of TNF in 
hSAECs. (H and I) TH5487 inhibits TNFa- (H) or LPS-induced (I) expression of 
proinflammatory genes in hSAECs. In (A), (B), (C), (E), (F), (H), and (1), parallel 
cultures of cells were treated with solvent or TH5487 (5 uM) for 1 hour and 
TNFa (20 ng/ml for 30 min) or LPS (100 ng/ml for 1 hour) was added. In (D) 
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and (G), decreasing concentrations of TH5487 were added before TNFa 

(20 ng/ml for 30 min). Changes in mRNA levels were determined by quantitative 
real-time PCR. Data are presented as averages + SD from at least three independent 
experiments. (J) TH5487 decreases binding of OGG1 to promoters in chromatin. 
(K) TH5487 perturbs DNA occupancy of NF-«B in chromatin. In (J) and (K), data are 
presented as averages + SD from four independent experiments, and MLE 12 cells 
were treated with solvent or 5 uM TH5487 for 1 hour and exposed to TNFa 

(20 ng/ml) for 30 min. Chromatin was immunoprecipitated using antibody to 
epitope-tagged OGGI, or the p65 subunit of NF-«B. Fold changes in OGG1 and 
NF-«B binding to the indicated proximal promoter regions were determined 
by qPCR. (L) TH5487 perturbs binding of NF-«B to 8-oxoG-containing synthetic 
DNA in nuclear extracts from MLE 12 or hSAEC cells. p50-p65, heterodimer 

of NF-«B; p50-p50, homodimer of NF-«B. Images are representative of three 
independent experiments. In (D) to (I), *P < 0.05, **P < 0.01, ***P< 0.001, and 
S is not significant, using unpaired two-sided Student's t test. 
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Fig. 4. TH5487 suppresses proinflammatory gene expression and lung 
inflammation in mice. (A and B) Groups of mice were treated intra- 
peritoneally with TH5487 (30 mg/kg) or vehicle, and lungs were TNFa- 
challenged intranasally (20 ng/ml). The bars represent expression levels 

of mRNAs pooled from lungs of six individual mice. Target gene signals 
were normalized to housekeeping genes, and all data analyses were performed 
according to the manufacturer's instructions using their web-based software 
package (www.qiagen.com/us/shop/genes-and-pathways/data-analysis- 
center-overview-page/) (n = 1 experiment). (©) Dose dependent inhibition of 
TNFa-induced neutrophil infiltration by TH5487. Mice (50% female and 50% 
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male) were treated intraperitoneally with increasing doses of TH5487 and 
challenged intranasally with TNFo. Sixteen hours after challenge, mice were 
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(C). In (C) and (D), ***P < 0.001, using unpaired two-sided Student's t test. 
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BIOTECHNOLOGY 


Programmed DNA destruction by 
miniature CRISPR-Casl4 enzymes 


Lucas B. Harrington}, David Burstein?*+, Janice S. Chen'|, David Paez-Espino*, 
Enbo Ma’, Isaac P. Witte’, Joshua C. Cofsky!, Nikos C. Kyrpides®, 
Jillian F. Banfield”**, Jennifer A. Doudna??®”*§ 


CRISPR-Cas systems provide microbes with adaptive immunity to infectious nucleic acids 
and are widely employed as genome editing tools. These tools use RNA-guided Cas 
proteins whose large size (950 to 1400 amino acids) has been considered essential to their 
specific DNA- or RNA-targeting activities. Here we present a set of CRISPR-Cas systems 
from uncultivated archaea that contain Cas14, a family of exceptionally compact RNA- 
guided nucleases (400 to 700 amino acids). Despite their small size, Cas14 proteins are 
capable of targeted single-stranded DNA (ssDNA) cleavage without restrictive sequence 
requirements. Moreover, target recognition by Cas14 triggers nonspecific cutting of ssDNA 
molecules, an activity that enables high-fidelity single-nucleotide polymorphism 
genotyping (Cas14-DETECTR). Metagenomic data show that multiple CRISPR-Cas14 
systems evolved independently and suggest a potential evolutionary origin of single- 


effector CRISPR-based adaptive immunity. 


ompetition between microbes and viruses 

stimulated the evolution of CRISPR-based 

adaptive immunity to provide protection 

against infectious agents (J, 2). In class 2 

CRISPR-Cas systems, a single 100- to 200-kDa 
CRISPR-associated (Cas) protein with multiple 
functional domains carries out RNA-guided bind- 
ing and cutting of DNA or RNA substrates (3, 4). 
To determine whether simpler, smaller RNA- 
guided proteins occur in nature, we queried 
terabase-scale metagenomic datasets (5-9) for 
uncharacterized genes proximal to both a CRISPR 
array and casi, the gene that encodes the uni- 
versal CRISPR integrase (J0, 11). This analysis 
identified a diverse family of CRISPR-Cas systems 
that contain casi, cas2, cas4, and a previously 
unrecognized gene, cas/4, encoding a 40- to 70-kDa 
polypeptide (Fig. 1A). We initially identified 24 
different casI14 gene variants that cluster into 
three subgroups (Casl4a, Cas14b, and Cas 14c) on 
the basis of comparative sequence analysis (Fig. 1, 
A and B, and figs. S1 and $2). Casl4 proteins are 
~400 to 700 amino acids (aa), about half the size of 
previously known class 2 CRISPR RNA-guided 
enzymes (950 to 1400 aa) (Fig. 1, C and D). Though 
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the identified Cas14 proteins exhibit considerable 
sequence diversity, all are united by the presence 
of a predicted RuvC nuclease domain, whose 
organization is characteristic of type V CRISPR- 
Cas DNA-targeting enzymes (Fig. 1D) (3, 12, 13). 
The Cas14 proteins we identified occur almost 
exclusively within DPANN, a superphylum of 
symbiotic archaea characterized by small cell 
and genome sizes (14, 15). Phylogenetic compar- 
isons showed that Cas14 proteins are widely 
diverse with similarities to C2cl10 and C2c9, fam- 
ilies of bacterial RuvC domain-containing pro- 
teins that are sometimes found near a CRISPR 
array but not together with other cas genes (Fig. 
1B and fig. S1) (3). This observation and the 
small size of the c2c10, c2c9, and casI14 genes 
made it improbable that these systems could 
function as stand-alone CRISPR effectors (3). 
On the basis of their proximity to conserved 
genes responsible for creating genetic memory 


A Type V CRISPR 


Casi2a 
(Cpf1) 


Cas12d 
(CasY) 


Cas12c 
(C2c3) 


Cas14a 
(6 variants) 


Casi2e~ Cas14b 2 
(CasXx) (16 variants) & 
4 = 

Casi2b ° Cas14c (2 variants) 2 
(C2c1) _ Miniature s 


~ CRISPR systems 


of infection (casI, cas2, cas4) (fig. S3A), we explored 
whether CRISPR-Cas14 systems can actively ac- 
quire DNA sequences into their CRISPR arrays. 
Assembled metagenomic contiguous DNA se- 
quences (contigs) for multiple CRISPR-Cas14 
loci revealed that otherwise identical CRISPR 
systems showed diversity in their CRISPR arrays. 
These results are consistent with active adapta- 
tion to new infections, although without longi- 
tudinal sampling these data could also be explained 
by alternative biological mechanisms (Fig. 2A and 
fig. S3B) (13). The evidence suggesting acquisition 
of new DNA sequences led us to hypothesize 
that these CRISPR-Cas14 loci encode functional 
enzymes with nucleic acid targeting activity de- 
spite their small size. To test this possibility, we 
first investigated whether RNA components are 
produced from CRISPR-Cas14 loci. Environmental 
metatranscriptomic sequencing data were ana- 
lyzed for the presence of RNA from the native 
archaeal host that contains CRISPR-Cas14a (Fig. 
2B and fig. S4A). In addition to CRISPR RNAs 
(crRNAs), a highly abundant noncoding RNA 
was mapped to a ~130-base pair sequence located 
between casi4a and the adjacent CRISPR array. 
Notably, the 3’ end of this transcript was mostly 
complementary to the repeat segment of the 
crRNA (Fig. 2C and fig. S4B), as observed for 
trans-activating CRISPR RNAs (tracrRNAs) found 
in association with Cas9, Cas12b, and Cas12e 
CRISPR systems (12, 13, 16). In these previously 
studied systems, the double-stranded RNA-cutting 
enzyme ribonuclease III (RNase III) generates 
mature tracrRNAs and crRNAs, but no genes 
encoding RNase III were present in cas14- 
containing reconstructed genomes (fig. S5A), nor 
did Cas14a cleave its own pre-crRNA when tested 
biochemically (fig. S5B). These observations imply 
that an alternative mechanism for CRISPR- 
associated RNA processing exists in these hosts. 

To test whether the Casl4a proteins and as- 
sociated RNA components can assemble together 
in a heterologous organism, we introduced a 
plasmid into Escherichia coli containing a mini- 
mal CRISPR-Cas14a locus that includes the cas14 
gene, the CRISPR array, and intergenic regions 


CRISPR array 
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Fig. 1. Architecture and phylogeny of CRISPR-Cas14 genomic loci. (A) Phylogenetic tree of 
type V CRISPR systems. Newly identified miniature CRISPR systems are highlighted in orange. 

(B) Representative loci architectures for C2cl0 and CRISPR-Cas14 systems. (C) Length distribution 
of Casl4a to Casl4c systems compared with Cas12a to Cas12e and Cas9. (D) Domain organization 

of Cas14a compared with Cas9 and Cas12a. Nuclease domains (RuvC and HNH) are indicated. Protein 


lengths are drawn to scale. 
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Fig. 2. CRISPR-Cas14a actively adapts and encodes a tracrRNA. (A) Spacer diversity for Casl4b4 
and Cas14b14 with CRISPR repeats diagramed in tan and distinct spacers shown in different colors. 
(B) Metatranscriptomic reads mapped to Casl4al and Cas14a3 loci. The insets show an expanded view 
of the most abundant repeat and spacer sequence. nt, nucleotides. (C) In silico predicted structure of 
Cas]4al crRNA and tracrRNA. Notably, RNase III orthologs were not identified in host genomes 

(fig. S5A). (D) Fraction of various CRISPR complex masses made up of RNA and protein. 


containing the putative tracrRNA. Affinity puri- 
fication of the Cas14a protein from cell lysate and 
sequencing of copurifying RNA revealed a highly 
abundant mature crRNA as well as the putative 
tracrRNA, albeit in lower relative abundance than 
what was shown by environmental metatran- 
scriptomics, suggesting that Casl4 associates 
with both crRNA and tracrRNA (fig. S5C). The 
calculated mass of the assembled Cas14a protein- 
tracrRNA-crRNA particle is 48% RNA by weight 
compared with just 17% for Streptococcus pyogenes 
Cas9 (SpCas9) and 8% for Francisella novicida 
Cas12a (FnCas12a) (Fig. 2D), hinting at a central 
role of the RNA in the architecture of the Casl4a 
complex. Known class 2 CRISPR systems require 
a short sequence called a protospacer adjacent 
motif (PAM) to target double-stranded DNA 
(dsDNA) (17). To test whether Casl4a requires a 
PAM and can conduct dsDNA interference, we 
transformed EF. coli expressing a minimal Casl4a 
locus with a dsDNA plasmid containing a ran- 
domized PAM region next to a sequence matching 
the target-encoding sequence (spacer) in the Cas14: 
array. Notably, no depletion of a PAM sequence 
was detected among E. coli transformants, sug- 
gesting that the CRISPR-Cas14a system is unable 
to target dsDNA, can do so without requiring a 
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PAM, or is inactive in this heterologous host 
(fig. S6, A and B). 

We next tested whether purified Casl4a- 
tracrRNA-crRNA complexes are capable of RNA- 
guided nucleic acid cleavage in vitro. All currently 
reconstituted DNA-targeting class 2 interference 
complexes are able to recognize both dsDNA and 
single-stranded DNA (ssDNA) substrates (18-20). 
We incubated purified Casl4a-tracrRNA-crRNA 
complexes with radiolabeled target oligonucleo- 
tides (ssDNA, dsDNA, and ssRNA) bearing a 20- 
nucleotide sequence complementary to the crRNA 
guide sequence or a noncomplementary ssDNA, 
and we analyzed these substrates for Cas14a- 
mediated cleavage. Only in the presence of a 
complementary ssDNA substrate was any cleav- 
age product detected (Fig. 3A and fig. $7, A to C), 
and cleavage was dependent on the presence of 
both tracrRNA and crRNA, which could also be 
combined into a single-guide RNA (sgRNA) (Fig. 
3B and fig. S8). The lack of detectable dsDNA 
cleavage suggests that Casl4a targets ssDNA se- 
lectively, although it is possible that some other 
host factor or sequence requirement could enable 
dsDNA recognition in the native host. Mutation of 
the conserved active site residues in the Casl4a 
RuvC domain eliminated cleavage activity (fig. S7, 
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D and E), implicating RuvC as the domain re- 
sponsible for DNA cutting. Moreover, Casl4a 
DNA cleavage was sensitive to truncation of the 
RNA components to lengths shorter than the nat- 
urally produced sequences (fig. S9, A to D). These 
results establish Casl4a as the smallest class 2 
CRISPR effector demonstrated to conduct pro- 
grammable RNA-guided DNA cleavage thus far. 

Although we were unable to identify a dsDNA 
PAM in vivo, we tested whether Casl4a requires 
a PAM for ssDNA cleavage in vitro by tiling Casl4a 
guides across a ssDNA substrate (Fig. 3C). Despite 
sequence variation adjacent to the targets of these 
different guides, we observed cleavage for all 
four sequences. Notably, the cleavage sites occur 
beyond the guide-complementary region of the 
ssDNA and shift in response to guide binding po- 
sition (Fig. 3C). These data demonstrate Casl4a 
is a SSDNA-targeting CRISPR endonuclease that 
does not require a PAM for activation. 

On the basis of the observation that Casl4a 
cuts outside of the crRNA/DNA targeting hetero- 
duplex, we hypothesized that Casi4a might possess 
target-activated nonspecific ssDNA cleavage activ- 
ity, similar to the RuvC-containing enzyme Cas12a 
(20, 21). To test this possibility, we incubated 
Casl4.a-tracrRNA-crRNA with a complementary 
activator DNA and an aliquot of M13 bacterio- 
phage ssDNA bearing no sequence complemen- 
tarity to the Casl4a crRNA or activator (Fig. 3D). 
The M13 ssDNA was rapidly degraded to small 
fragments, an activity that was eliminated by 
mutation of the conserved Casl4a RuvC active 
site, suggesting that activation of Casl4a results 
in nonspecific ssDNA degradation. However, we 
were unable to observe Casl4a-mediated inter- 
ference against the ssDNA bacteriophage ®X174 
when we expressed Casl4a heterologously in 
E. coli (fig. S10, A to C), possibly due to the dis- 
similarity between E. coli and Casl4a’s native 
archaeal host. To investigate the specificity of 
target-dependent nonspecific DNA cutting activity 
by Casl4a, we adapted a fluorophore-quencher 
(FQ) assay in which cleavage of dye-labeled 
ssDNA generates a fluorescent signal (Fig. 4A) 
(22). When Casl4a was incubated with various 
guide RNA-target ssDNA pairs, a fluorescent sig- 
nal was observed only in the presence of the 
cognate target and showed strong preference 
for longer FQ-containing substrates (fig. S10D 
and Fig. 4A). We next tested Cas14a mismatch 
tolerance by tiling 2-nucleotide mismatches across 
the targeted region in various ssDNA substrates. 
Surprisingly, mismatches near the middle of the 
ssDNA target strongly inhibited Casl4a activity, 
revealing an internal seed sequence that is 
distinct from the PAM-proximal seed region ob- 
served for dsDNA-targeting CRISPR-Cas systems 
(Fig. 4B and fig. S11, A to D). Moreover, DNA 
substrates containing strong secondary structure 
resulted in reduced activation of Casl4a (fig. SIIE). 
Truncation of ssDNA substrates also resulted in 
reduced or undetectable trans cleavage (fig. S11F). 
Together, these results suggest a mechanism of 
fidelity distinct from dsDNA-targeting class 2 
CRISPR systems, possibly using a mechanism simi- 
lar to the ssRNA-targeting Cas13a enzymes (23-25). 
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Fig. 3. CRISPR-Cas14a is an RNA-guided DNA endonuclease. 
(A) Cleavage kinetics of Cas14al targeting ssDNA, dsDNA, ssRNA, and off-target 
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(D) Cleavage of the ssDNA viral M13 genome with activated Cas14al. 
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with a blue-eye targeting guide for 
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brown-eyed individuals compared 
with ssDNA detection using Casl2a. 


The target-dependent, nonspecific DNase activ- 
ity of Cas12a serves as a DNA detection platform 
(DNA endonuclease-targeted CRISPR trans re- 
porter, or DETECTR) for diagnostic uses (20, 26). 
Although Cas12a exhibits low fidelity in discrim- 
inating against ssDNA substrates (20), Casl4a 
requires complementarity in the seed region for 
ssDNA substrate recognition. This improved spec- 
ificity raised the possibility of using Casl4a for 
high-fidelity detection of DNA single-nucleotide 
polymorphisms (SNPs) without the constraint 
of a PAM sequence. To test this idea, DNA sub- 
strates were amplified using a phosphorothioate 
(PT)-containing primer to protect one strand from 
degradation by exonucleases. Upon addition of T7 
exonuclease, the unmodified strand was degraded, 
leaving ssDNA substrates that can be detected by 
Casl4a (Fig. 4, C and D). As a proof of principle, we 
aimed to detect the human HERC2 gene, which 
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contains a SNP responsible for eye color (27). We 
amplified the HERC2 gene from DNA in human 
saliva from both blue-eyed and brown-eyed indi- 
viduals, using the PT amplification approach 
described above. When programmed with a guide 
RNA targeting the blue-eyed SNP, Cas12a failed 
to discriminate between the two ssDNA targets, 
exhibiting robust trans activity in both cases, 
whereas Casl4a exhibited strong activation 
in recognition of the blue-eyed SNP with near- 
background signal for the brown-eyed sample 
(Fig. 4E). The development of Cas14-DETECTR 
now allows for CRISPR-based detection of medi- 
cally and ecologically important ssDNA patho- 
gens as well as high-fidelity detection of SNPs 
without the constraint of a PAM sequence. 
Further investigation of compact type V systems 
in metagenomic data revealed a large diversity of 
systems that, like Casl4a to Casl4c, include a gene 
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encoding a short RuvC-containing protein adja- 
cent to acquisition-associated cas genes and a 
CRISPR array. We found 20 additional such sys- 
tems in various uncultivated microbes that cluster 
into five main families (Cas14d to Cas14h). Exclud- 
ing cas]4g, which is related to cas12b, the casI4- 
like genes form separate clades on the type V 
effector phylogeny (fig. S12, A and B), suggest- 
ing that these families evolved from independent 
domestication events of TnpB, the transposase- 
associated protein implicated as the evolutionary 
ancestor of type V CRISPR effectors (3). Phylo- 
genetic reconstruction of their associated casI 
genes indicated that they too have different ori- 
gins for the casI4 subtypes (fig. S2). Altogether, 
we identified 38 CRISPR-Cas14 systems belonging 
to eight families (Casl4a to Cas14h) and eight 
additional systems that could not be clustered 


with our analysis (termed Cas14u) (data S3). 
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The small size of the Cas14 proteins described 
here and their resemblance to type V effector 
proteins suggest that RNA-guided ssDNA cleav- 
age may have existed as an ancestral class 2 
CRISPR system (28, 29). In this scenario, a small, 
domesticated TnpB-like ssDNA interference com- 
plex may have gained additional domains over 
time, gradually improving dsDNA recognition 
and cleavage. Related to this hypothesis, smaller 
Cas9 orthologs exhibit weaker dsDNA-targeting 
activity than their larger counterparts but retain 
the ability to robustly cleave ssDNA (19). Aside 
from the evolutionary implications, the ability of 
Cas14 to specifically target ssDNA suggests a 
role in defense against ssDNA viruses or mobile 
genetic elements that propagate through ssDNA 
intermediates (30). A ssDNA-targeting CRISPR 
system would be particularly advantageous in 
certain ecosystems where ssDNA viruses consti- 
tute the vast majority of viral abundance (31). The 
unexpected finding that these miniature CRISPR 
proteins can conduct targeted DNA cleavage 
highlights the diversity of CRISPR systems hid- 
den in uncultivated organisms. Ongoing explora- 
tion of these underrepresented microbial lineages 
will likely continue to reveal new, unexpected 
insights into this microscopic arms race and lead 
to continued development of valuable CRISPR- 
based technologies. 
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An integral tool in biological research, protein expression 

is increasingly important as a pharmaceutical production 
mechanism for biological therapeutics. The ability to direct the 
ultimate nature of proteins depends largely on the degree of 
control researchers have over the final product. The combi- 
nation of old technologies—such as artificial enzymes—and 
new next-generation expression systems is revitalizing protein 
expression protocols. By Caitlin Smith 


he advent of genome editing tools such as the CRISPR/ 
Cas9 system has given scientists an affordable, 
accessible, and relatively simple method to alter 
genes. Yet precisely controlling protein expression 
through structural and posttranslational modifications takes on 
immense importance in the context of expressed therapeutic 
proteins, such as prescription medications. Controlling protein 
expression in other species, such as mosquitoes, may even 
aid researchers in eradicating malaria, the Zika and West Nile 
viruses, dengue fever, and other mosquito-borne illnesses. 

Our quickly evolving molecular tools have far-reaching 
effects, as evidenced by the wide range of CRISPR applications 
already being used. "CRISPR systems are amazing resources 
for finding new tools to manipulate RNA and DNA,” says 
Jamie Cate, professor in the departments of molecular and 
cell biology and biochemistry at the University of California, 
Berkeley. “| see the present as the biological equivalent of the 
transistor revolution for computers.” 

Other molecular tools are evolving alongside—and 
sometimes interacting with—-CRISPR systems to broaden the 
scope and increase the potency and versatility of controls 
on protein expression. Artificial enzymes, for example, are 
shedding their limitations and gaining programmability. 
Molecular switches are becoming multifaceted, enabling 
greater fine-tuning. Next-generation expression systems can 
churn out greater yields of complex biological therapeutics 
than ever before. One thing is certain—as molecular controls 
become increasingly sophisticated, their myriad uses continue 
to expand. 
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Protein expression is 
getting a boost from 
updated and recent 
technologies. 


Artificial enzymes for targeting DNA and RNA 

Although it's a powerful tool, even the CRISPR/Cas9 system has 
occasional drawbacks. One is the requirement for a protospacer 
adjacent motif (PAM) sequence just upstream of the intended 
editing site. The PAM sequence is a binding signal for the Cas? 
enzyme, so its presence is necessary, but also limiting (if a PAM se- 
quence doesn't exist upstream of a desired editing site, one needs 
to be created). Huimin Zhao, theme leader for biosystems design 
at the Carl R. Woese Institute for Genomic Biology, University of 
Illinois at Urbana-Champaign, attempted to develop a CRISPR-like 
genome editing tool that would remove this constraint, and offer 
more versatility by permitting editing without a PAM sequence. 

Zhao’s lab developed an artificial restriction enzyme (ARE) 
system based on PfAgo, an Argonaute protein from the archaeon 
Pyrococcus furiosus, a prokaryote (single-celled) microbe. This 
platform surpasses traditional type II restriction enzymes— 
molecular scissors that cleave DNA at predefined sites (7). 

“We can readily create an unlimited number of artificial 
restriction enzymes, with desired sequence specificity and 
defined sticky ends of varying lengths, using a simple system 
consisting of a single protein, PfAgo, and two DNA guides for 
targeting a specific dsDNA [double-stranded DNA] sequence,” 
says Zhao. “This system can be multiplexed, in that the same 
protein can be loaded with multiple DNA guides for targeting 
several sites simultaneously.” 

Because the DNA guides show PfAgo where to cut, the 
system can be programmed to cut virtually anywhere. Another 
advantage is that PfAgo has longer recognition sequences 
(typically 16 base pairs) than do traditional restriction enzymes 
(which typically recognize 4 to 8 base pairs); a longer recognition 
sequence makes it more likely to find a unique cleavage site. 
Unlike previous artificial restriction enzymes, PfAgo can generate 
defined and longer sticky ends when cutting DNA, which aid in 
subsequently attaching DNA fragments together. 

Zhao’s lab is eager to apply its new technology. “We also 
developed a PfAgo/ARE-based direct cloning method to clone 
large, natural-product biosynthetic gene clusters for discovery 
of novel natural products that can potentially be used cont.> 
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as antibiotics and anticancer drugs,” says Zhao. His newly 
formed company, Modular Bioscience, will explore PfAgo’s 
use in medical diagnostics, such as liquid biopsies, and single- 
nucleotide polymorphism and pathogen detection. 

Another CRISPR-associated Argonaute protein from the bacte- 
rium Marinitoga piezophila (MpAgo) is in the crosshairs of Cate’s 
lab, which focuses on protein-translation mechanisms and regu- 
lation (2). “Our goal was to make an RNA-targeting technology 
that we could use to explore RNA biology in human cells,” says 
Cate. They wanted to target RNA using an RNA-guided protein, 
as an alternative to labeling RNAs with covalent tags. 

Postdoctoral researcher Audrey Lapinaite, a member of Cate’s 
team, found the main challenge was “how to load the guide RNA 
into MpAgo inside cells,” says Cate. She solved this problem by 
assembling guide RNA-protein complexes (RNPs) in vitro, then 
using the RNPs in cell-based experiments. Lapinaite found that 
a modification of the 5'-nucleotide of the guide RNA resulted in 
easily programmable RNPs with high affinity to their fully comple- 
mentary RNA targets. Furthermore, the modified RNPs show 
high specificity, and can discriminate between RNA substrates 
differing by only one nucleotide. 

Cate is excited to explore the prospects for manipulating RNAs 
using MpAgo RNPs. “We are most eager to use MpAgo for RNA 
targeting in human cells to explore human RNA biology, such 
as in imaging and proteomics experiments,” he says. Because 
MpAgo is derived from bacteria, its use for therapeutic purposes 
is unlikely—but Cate hopes to apply this system to gain insight 
into how endogenous human Argonautes work. 


Molecular switches 

Given the enormous capabilities of CRISPR/Cas? genome 
editing, the search is on for ways to make the system induc- 
ible, so that editing can be turned on or off at specific times. A 
United Kingdom-based collaboration between Yu-Hsuan Tsai 
from Cardiff University and Anthony Perry from the University of 
Bath reveals a new type of inducible CRISPR switch (3). Previous 
switches had drawbacks, such as leakiness of editing activity in 
the absence of signal, or a reliance on antibiotic use, which can 
increase the risk of antibiotic resistance. “We envisaged the use 
of an artificial, nonphysiological amino acid [that] would address 
these problems,” says Tsai. 

Tsai and Perry’s groups used genetic code expansion to make 
a genome sensitive to an artificial signal. In cell lines and mouse 
embryos, researchers “expanded” the genetic code by inserting 
a toolkit that makes the expression of Cas9 (the enzyme required 
for genome editing) dependent on the presence of lysine de- 
rivative Lys (Boc). This non-natural amino acid is ideal for this 
purpose because it is cheap, safe, readily obtainable, and easy to 
administer to cells or whole animals. 

The researchers showed that the presence of Lys (Boc) resulted 
in genomic editing, while in its absence no genomic editing oc- 
curred. The success of Tsai and Perry's switch may be due in part 
to the difference in strategies: “Our approach controls the trans- 
lation of the functional Cas9 protein, whereas previous methods 
[use] posttranslational control, such as modulating the activity of 
translated protein by different stimuli,” says Tsai. 

Future applications of their work include gene drives, which 
virtually ensure that all progeny will inherit genetic changes that 
rapidly spread throughout a population of animals (in herds of 
livestock, for instance). Because the effect is inducible, its power 
can be more safely controlled. This new, inducible CRISPR switch 
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Next-gen expression systems may see further 


record expression levels with the incorporation of 
genome editing tools like CRISPR/Cas9. 


may be more suitable “in clinical therapeutic genome editing in 
situ, or gene drives in which environmentally compatible control 
is paramount,” says Tsai. 

Another type of molecular switch features an artificial enzyme, 
and was developed by a Swiss collaboration headed by Tom Ward, 
director of NCCR (National Centre of Competence in Research) 
Molecular Systems Engineering, a multidisciplinary initiative 
comprising researchers at the University of Basel and ETH Zurich 
(Swiss Federal Institute of Technology). Ward's lab took a modular 
design approach, combining their knowledge of artificial metal- 
loenzymes with expertise in cell-penetrating modules from Stefan 
Matile’s lab in the Department of Organic Chemistry at the Univer- 
sity of Geneva, and a synthetic gene-switch module produced by 
Martin Fussenegger’s lab in the Department of Biosystems Science 
and Engineering at ETH Zurich. 

This new cell-penetrating disulfide module lets an artificial 
metalloenzyme enter a cell without harming it, “reminiscent of a 
Trojan horse,” says Ward. Upon entry, the enzyme catalyzes a reac- 
tion that uncages a hormone. The synthetic gene-switch module 
detects the newly uncaged hormone and responds by turning on 
expression of the fluorescent indicator luciferase. “The gene switch 
is thus turned on by an abiotic reaction operated by an artificial 
enzyme,” says Ward. 

Now that the collaboration has established proof-of-concept for 
their system (4), they are actively pursuing biomedical applications 
that include interacting with a host protein—for example, a protein 
expressed by cancer cells that enables precise targeting. "Car- 
bonic anhydrase is overexpressed on the surface of many cancer 
cell lines,” says Ward. “We could use this protein to accumulate the 
artificial metalloenzyme either on the surface or inside of cancer 
cells; the system is turned on only when the artificial metalloen- 
zyme binds to carbonic anhydrase.” 


Next-generation expression systems 

Advances in molecular control of gene expression are also 
expanding the opportunities for protein expression systems. 
As drug manufacturers look for cheaper ways to express new 
biologic drugs, next-gen expression systems are gaining traction. 
Currently, the standard expression system is mammalian Chinese 
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hamster ovary (CHO) cell lines. However, the limitations of CHO 
cells—such as high cost and comparatively slow doubling times— 
make the next-gen systems worth considering for expressing 
new biopharmaceuticals, especially to help reduce the costs of 
important medicines. 

For example, the SoluPro protein expression platform from 
AbSci can generate soluble, properly folded proteins at extremely 
high titers, currently 4g/L of full-length antibody and >20g/L 
of other complex products, using Escherichia coli expression. 
Typically, some human proteins can’t be expressed in E. coli, 
and require mammalian cell lines for proper folding. AbSci’s 
technology includes two innovations that make it possible to 
produce such proteins in E. coli, while taking advantage of the 
simplicity and lower costs of this expression system. 

One creation is a semioxidized cytoplasm that produces 
soluble, disulfide-linked proteins. “Cytoplasmic production, which 
traditionally is limited by inclusion body formation, is desirable 
because it has significantly higher capacity than the periplasm, 
places no restrictions on protein size, and achieves dramatically 
shorter production cycles of one to two days, compared to 
secretion-based expression systems,” says Sean McClain, AbSci's 
founder and CEO. The second innovation is SoluPro’s dual 
inducible promoters, which can be independently controlled. This 
enables “tuning” of protein production rates by optimizing for the 
best protein-folding and titer. 

“The SoluPro system’s ability to properly fold proteins 
overcomes a large majority of the limitations found with 
traditional E. coli expression,” says McClain. "We have successfully 
produced novel antibody scaffolds as well as |IgG1 and IgG4 
molecules where effector function is not desirable.” Many of these 
novel antibody scaffolds, which are gaining increasing traction 
in development pipelines, are challenging to produce in CHO 
cells. “AbSci is keenly focused on these hard-to-produce next- 
generation antibody scaffolds, including bispecifics, Fc [fragment 
crystallizable]-fusion proteins, and other multispecific products, 
which SoluPro is ideally suited to manufacture efficiently,” he says. 

Dyadic offers another next-gen expression platform, the 
fungal-based C1 gene expression system. Dyadic’s scientists took 
advantage of a serendipitous mutation that resulted in a several 
100-fold increase in protein productivity of the filamentous 
Myceliophthora thermophila fungus (which the company 
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nicknamed “C1,” then created a molecular toolset to turn the 
fungus into a recombinant expression host). They are currently 
focused on biomedical efforts, applying the C1 expression 
platform to making biologic vaccines and drugs more affordable 
and accessible. Though filamentous fungus may sound exotic, 

in nature they are natural secreters (C1 has a 2-hour doubling 
time, compared to about 20 hours for CHO cells). They also use 
defined synthetic media, which is cheaper and avoids the need 
for viral inactivation required for CHO cells. 

Today, biosimilars are being produced ineffectively by cell lines 
such as CHO cells, according to Dyadic’s CEO Mark Emalfarb. 
However, says Emalfarb, “We can produce up to 2 to 5 times more, 
in a third of the time, and at a fraction of CHO media costs [as 
compared to] the reported average industry CHO productivity.” 
And because C1 secretes its product into the media, the down- 
stream protein harvesting steps are also simpler than those in oth- 
er systems—for example, E. coli requires lysing of cells and purify- 
ing product from cell fractions, adding more complexity and cost. 

Dyadic's C1 platform efficiently produces full-length monoclonal 
antibodies, antibody heavy and light chains, Fc-fusion proteins, 
Fabs (antigen-binding fragments), bispecific antibodies, and vac- 
cines. “We can also make VLPs [virus-like particles], which are theo- 
retically more potent types of vaccines, and are more difficult to 
express in general,” says Emalfarb. “We even have a secreted VLP 
now, so less is lost in downstream processing.” 

Dyadic is also engineering C1 to produce human-like 
glycosylation in different forms, to allow pharmaceutical 
companies to evaluate them. "Unlike CHO cells, C1 cells are 
monoclonal cells,” explains Matthew Jones, Dyadic’s chief 
commercial officer. “C1 has the potential to produce more 
consistent, more homogeneous glycostructures for companies 
to evaluate, to test which ones may work better.” A recently 
announced collaboration with the biopharmaceutical company 
Sanofi-Aventis Deutschland GmbH will study the use of the C1 
technology to express different types of therapeutic compounds, 
such as vaccines and protein-based biologics. 

Both E. coli- and yeast-based expression systems share other 
advantages, such as not requiring the expensive viral clearance 
steps needed in CHO cells. In addition, the lower costs of next-gen 
expression systems can in turn lower drug prices, while allowing 
drug manufacturers to maintain a profit margin—which incentivizes 
manufacturers to produce drugs with lower prices and make them 
available to the public. Both McClain and Emalfarb hope their 
companies’ technologies will encourage manufacturers to market 
drugs that are good for society but not otherwise profitable, such 
as new influenza vaccines that cost less but work better. 

None of this would be possible without the genetic tools that 
harness expression systems. As scientists achieve finer controls 
over genomes, the number of expression products will soar. For 
example, next-gen expression systems may see further record 
expression levels with the incorporation of genome editing 
tools like CRISPR/Cas9. As these tools continue to grow in 
sophistication, the benefits to patients will continue to increase— 
and may transform biomedicine in the process. 
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LIFE SCIENCE TECHNOLOGIES 


new products: tissue/cell culture 


Cloning Kit 

Gibson Assembly Cloning allows 
for the insertion of one or more 
DNA fragments into virtually 
any location of any vector 
without the need for compatible 
restriction sites. In a single 
round of cloning, join multiple DNA fragments (1-15) to create seamless 
constructs without subcloning. With easy reaction setup and less than 
5 min of hands-on time, go from DNA fragments to transformation- 
ready constructs in an hour or less. Gibson Assembly Cloning achieves 
cloning efficiencies of up to 95%. Three kits are available: Gibson 
Assembly HiFi 1-Step Kit, Gibson Assembly Ultra Kit, and Gibson 
Assembly Site-Directed Mutagenesis Kit. 

SGI-DNA 

For info: 855-474-4362 

www.sgidna.com/reagents.html 


Minicircle Services 

When you want sustained transgene expression without the risks of 
introducing exogenous DNA—such as for developing animal models or 
pioneering new gene-therapy approaches—minicircle technology is a 
great option. But many labs don't have either the time or the in-house 
expertise to spend constructing and producing minicircles—which is 
why we offer Minicircle Cloning & Production Services. Delivered by our 
experienced scientific team, who routinely produce minicircles for large 
pharma partners, our services are very customizable. We can provide 

a complete end-to-end experience—we clone your complementary 
DNA (cDNA), microRNA (miRNA), or small hairpin RNA (shRNA) into one 
of our highly regarded minicircle parental vectors and assemble large 
quantities of minicircles—or only parts of the cloning and production 
process. We can even customize the parental vector for you. 

System Biosciences 

For info: 888-266-5066 

www.systembio.com 


cDNA Clones 

TrueORF Gold is OriGene’s collection of premium complementary DNA 
(cDNA) clones that have passed the ultimate tests: sequence verifica- 
tion and protein expression validation. Each TrueORF Gold clone has 
been used to transfect human cell lines for protein expression. In 
fact, OriGene has produced over 12,000 overexpression lysates from 
TrueORF-transfected human embryonic kidney 293 (HEK293) cells and 
subsequently purified over 5,000 human proteins from those cells. 
Each clone is fully sequenced with downloadable chromatogram files 
posted online. View and analyze the sequences before purchasing, 
and you will never worry about mutations when using TrueORF Gold. 
All clones are available for next-day shipment. Why spend weeks 
constructing a cDNA clone or having it synthesized? TrueORF Gold 
accelerates your research, experiment, degree, publication, or grant 
application, and gets you there faster. There is no need for subcloning 
or plasmid preparation. Order a TrueORF Gold clone today, and start 
your experiment tomorrow. 

OriGene Technologies 

For info: 888-267-4436 

www.origene.com 
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HDR Plasmid Donor Kits 

The Edit-R HDR Plasmid Donor Kits enable you to easily create a 
plasmid donor for insertion of either the mKate2 or enhanced green 
fluorescent protein (EGFP) fluorescence reporters, or a custom 
gene sequence of your choice, into almost any genomic location. 
Gene engineering using CRISPR/Cas9 relies on two pathways for 
repair of double-strand breaks in DNA: nonhomologous end joining 
(NHEJ) and homology-directed repair (HDR). Precise repair of DNA 
in order to generate a specific mutation or insert or remove a 
desired sequence requires the HDR pathway, and depends on the 
availability of a DNA donor template that can be used to repair the 
DNA break. The kits enable fast, efficient assembly of a DNA donor 
plasmid, and contain validated protocols for design, assembly, and 
confirmation of the plasmid. 

Dharmacon 

For info: 800-235-9880 

dharmacon.horizondiscovery.com 


Cell-Free Linear DNA Expression Kit 

Arbor Biosciences’ myTXTL Linear DNA Expression Kit allows for 
the use of linear DNA, including PCR products, gene fragments, and 
synthesized DNA, as input for transcription (TX) and translation 
(TL) in an Escherichia coli-based cell-free platform. Removing the 
requirements for cloning genes into plasmids, transforming cells, and 
further selecting for clones will greatly accelerate the design-build- 
test cycle for synthetic biology research and sampling in protein- 
screening applications. Created for use in pilot-scale studies as well 
as high-throughput analysis on automated liquid-handling platforms, 
myTXTL Linear DNA Expression Kit delivers a complete solution for 
protein research and analysis. The E. coli-based system is a ready-to- 
use master mix containing all the critical components required for 
cell-free protein production. End-users simply need to add their linear 
DNA to the master mix and incubate for as little as 30 min before 
analyzing the expressed product. 

Arbor Biosciences 

For info: 734-998-0751 

www.arborbiosci.com 


Human Exosomes 

ZenBio offers fully characterized exosomes from placental 
mesenchymal stem cells, cord serum, and custom isolation. 
Exosomes are membrane vesicles stored intracellularly in 
endosomal compartments, and are secreted when these struc- 
tures fuse with a cell’s plasma membrane. They contain protein, 
DNA, and RNA, making them an attractive vector for paracrine 
signals delivered by stem cells. They may also be “loaded” with 
predetermined proteins and nucleic acid to achieve a desired effect, 
and can be stored as an off-the-shelf product with the potential to 
circumvent many of the limitations of viable cells for therapeutic 
applications in regenerative medicine. In vitro, exosomes from 
preadipocytes stimulate cell proliferation in a wound-healing 
model. In vivo, adipose-graft-derived exosomes have been 

shown to be a promising tool for skin repair and remodeling. 
ZenBio 

For info: 919-547-0692 
www.zen-bio.com/products/cells/human-exosomes.php 
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Gene engineering using CRISPR/Cas9 relies on two pathways for 
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in order to generate a specific mutation or insert or remove a 
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Faculty Position in Medicinal Chemistry at The 
University of Kansas 


We seek to fill a tenure track faculty position at the 
Assistant Professor level to commence by August 2019. 
Consideration will be given to applicants experienced 
in medicinal chemistry, chemical biology, natural 
products chemistry, organic chemistry, biochemistry 
or a closely related field. See detailed position de- 
scription and application instructions at website: https:// 
employment.ku.edu/faculty/13139BR. Application 
review begins December 5, 2018 and will continue as 
needed to fill the position. KU is an Equal Opportunity / 
Affirmative Action Employer, full policy http://policy.ku.edu/ 
IOA/nondiscrimination. 
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Laureates of the 2017 Scientific Prizes * 


The Fondation Bettencourt Schueller: 
Helping scientists spread their wings 


The Fondation Bettencourt Schueller is a French philanthropic organization 
founded in 1987 by Liliane Bettencourt, her husband, André Bettencourt, and 
her daughter, Francoise Bettencourt Meyers. The mission of the foundation is 
to support excellence and innovation in biomedical research and the arts, and to 
promote an inclusive society, all encapsulated in their motto, “Give talent wings.” 


Sponsored by 


CG Fondation 
) a Bettencourt 
Schueller 


Reconmue dertilité publique depuis 1987 


“You have to understand that 
this is a family foundation,” 
says Hugues de Thé, president 
of the foundation's scientific 
advisory board and professor 
of oncology at the Collége de 
France in Paris. “And for the 
past 30 years the family has 
infused the foundation with 
its values.” The foundation 
esteems “passion anda 
vision of disruptive and risky 
projects, creativity, liberty 
and innovation through hard work, and internationally 
competitive excellence,” says de Thé. 

To achieve its mission, the foundation awards prizes and 
grants to individuals who embody these values. Applications 
and nominations for life sciences prizes and other funding 
requests are stringently evaluated by an advisory board of 
13 accomplished scientists, and by external review. 

A team of 20 foundation members works closely with 
the advisory board to identify important fields of 
endeavor in need of funding, and supports awardees 
and project developers, providing resources such as 
advice on business models, governance choices and 
best practices, networking opportunities, and techni- 
cal support. 


Hugues de Thé 


Funding the future 
The foundation is best known in the scientific community for the 
annual Liliane Bettencourt Prize for Life Sciences, a prestigious 
award whose recipients include May-Britt Moser and Edvard I. 
Moser in 2006, winners of the 2014 Nobel Prize in Physiology or 
Medicine, and Benjamin Lehner in 2016, winner of the 2016 Euro- 
pean Molecular Biology Organization (EMBO) Gold Medal. This 
€ 300,000 (USD 350,000) prize is awarded to a researcher aged 
45 years or younger; in odd-numbered years, the winner is cho- 
sen from researchers who are based in France, and in even- 
numbered years from those in Europe outside France. 

The foundation awards three other life sciences prizes: 
(1) The Bettencourt Prize Coups d’élan pour la recherche fran- 
gaise, which aims to improve infrastructure and working condi- 
tions for biomedical researchers by funding renovation, reorga- 
nization, acquisition of equipment and materials, and operational 
assistance. This € 250,000 (USD 290,000) prize is awarded 
annually to four research teams at the French National Institute 
of Health and Medical Research (Inserm) or at the Institute of Bio- 
logical Sciences (INSB) of the National Center for Scientific Re- 
search (CNRS) in France; (2) The Young Researchers Bettencourt 
Prize, a € 25,000 (USD 29,000) prize awarded annually to 14 top 
early-career French Ph.D.s or M.D./Ph.D.s to enable them to do a 
postdoctoral internship abroad; and (3) The ATIP-Avenir grant, a 
€ 300,000 (USD 350,000) award in partnership with Inserm and 
CNRS, given to researchers with outstanding projects who wish to 
create their own team and return or move to France. 
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In addition to these prizes, 
the foundation evaluates a 
continuous stream of requests 
and makes numerous grants in 
support of life sciences research. 
One example is an M.D./Ph.D. 
program established in France 
15 years ago by the foundation 
in partnership with Inserm; this 
year, the program will include 
. funding to allow assistant 
Edith Heard professors in medicine with a 
Ph.D. to have some protected time for research, away from 
hospital duties. Other examples include helping establish the 
Imagine Institute, the Brain and Spine Institute (ICM), and the 
Center for Interdisciplinary Research (CRI) in France; funding a 
system to digitize lectures at the College de France and make 
the videos publicly available; and promoting science education 
for high school students and the general public. 

“The foundation listens to what are the emerging themes 
and areas that should be better explored, and that is very 
special—to have a foundation that is right there with the scien- 
tists, putting their finger on the pulse of what's important to 
do,” says Edith Heard, professor at the College de France and 
unit director at the Institut Curie, in Paris; future director gen- 
eral of the European Molecular Biology Laboratory (EMBL) 
starting in January 2019; and member of the foundation's 
scientific advisory board. “They are unique because they have 
flexibility in terms of funding that one doesn't find elsewhere. 
The foundation is extremely professional in the way they do 
this and yet you feel part of a family, that you can connect 
with them.” 

As part of its humanitarian mission to promote an inclusive 
society, the foundation funds innovators who are working to 
create opportunities for people with disabilities, such as autism 
and hearing loss, or who are homeless or mentally ill. In 2016, 
the foundation established the Fondation Pour Il'‘Audition, 
which supports research, treatment, and prevention of hearing 
loss with the goal of giving people with hearing issues equal 
access to educational, professional, and social opportunities. 
“They really care about helping society,” says Heard. "That is 
something that comes across quite strongly in all of the actions 
| see coming out of the foundation.” 


Making a difference 
one researcher at 
a time 

An awardee who represents 
the creativity and leadership 
valued by the foundation is 
Cédric Blanpain, professor 
at the Université Libre de 
Bruxelles, Belgium, and 
WELBIO (Walloon Excellence 
in Lifesciences and 
Biotechnology) investigator. 
Blanpain was the 2012 
recipient of the Liliane 
Bettencourt Prize for Life 
Sciences for his discoveries 
in stem cell and cancer 
cell biology, which were 
made using novel methods 
developed by his lab. On the 
impact of winning the prize 
on his research, Blanpain 
says, "Of course | was very 
happy to get this award. It 
increases your visibility and 
helps you recruit better 
employees.” 

Blanpain’s group dis- 
covered that killing a minor 
population of cancer stem 
cells via lineage ablation can 
lead to tumor regression. 
Describing this technique, he says, "The way we use lineage 
ablation—there are many different ways—is to express the 
receptor of an important toxin in the cancer stem cell population. 
If you don't express its receptor, the toxin is inert. But when 
the receptor is present and we inject the toxin, we wipe out 
the population of cancer stem cells carrying the receptor, and 
the tumor shrinks.” His group also identified a population of 
tumor stem cells, with characteristics of both mesenchymal and 
epithelial cells, that can give rise to metastasis. 

In describing his research approach, Blanpain says, "My 
scientific background has been influenced by my medical 
background, so much of my research has some kind of 
health-related objective. What | try to do in my lab is to use 
model systems to observe new paradigms in cancer cell and 
stem cell biology. We then use all the possible technologies 
offered to us—transcriptome assays, chromatin epigenetic 
characterization, and others—to understand the underlying 
molecular mechanisms. Finally, we attempt to apply what we 
have learned from this basic model to improving human health 
and adding to medical knowledge.” 

Awards are long-term investments for the foundation. “| 
know that former awardees may ask for additional support 
if they have new original and breakthrough projects” says 
Blanpain. 

Heard echoes this sentiment. “They take great care of the 
people they connect with,” she says. “Overall, it is a very 
interesting model for how to keep investing in the most 
promising research, and to get a feeling of what is needed in 
specific areas. It’s also an example of how a foundation can 
really help science work, and hopefully it will lead to others 
adopting the model.” 


Cédric Blanpain 
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HINA 3B “K= fh” 


The Yangtze River Delta: 


Growing and Prospering by Relying on Integration and Talent 


he Yangtze River Delta has boasted the 
densest population, the most developed 
economy, the most thriving culture, and 
the highest living standard in China since 
ancient times. Because of its exceptionally advan- 
tageous location and abundant natural resources, 
the area has formed two strategic hubs that link it 
to oceans and connect it with inland areas. 
The total economic aggregate of the delta’s three 
provinces (Jiangsu, Zhejiang, and Anhui) and one 
municipality (Shanghai) reached RMB 1.95 billion 
(equivalent to USD 295 million as per the exchange 
rate at the end of 2017) for 2017, accounting for 
23.6% of China’s aggregate. The economic aggre- 
gate of the Yangtze River Delta Urban Agglomer- 
ation (YRDUA), which includes 26 cities, reached 
RMB 1.65 billion (USD 257 million) for 2017, 
ranking it as the fifth of the top six urban agglom- 
erations worldwide. The per capita gross domestic 
product (GDP) of the area for 2017 reached RMB 
87,400 (USD 13,600), equaling the primary level of 
developed countries. The per capita GDP of Shang- 
hai, Jiangsu, and Zhejiang for 2017 reached RMB 
124,600, RMB 107,200 and RMB 92,100 (USD 
19,390, USD 16,680, and USD 14,330) respective- 
ly, nearly reaching the median level of developed 
countries as per the USD averages. 
From the perspective of both economic aggregate 
and per-capita GDP, the YRDUA has ranked as 
an urban agglomeration with one of the largest 
economic aggregates and the highest economic 
vitality worldwide—and it is expected that these 
conditions will continue to improve. 


Yangtze’s Plan to 
Integrate and Develop 


The three-year action plan for the integrative devel- 
opment of the Yangtze River Delta was published 
in 2018, further focusing on key areas such as traf- 
fic interconnection, mutual aid and protection for 
energy development, collaborative innovation in 
industries, the building of high-speed information 
highways, joint disaster prevention and environ- 
mental control, extensive and convenient public 
services, and the orderly opening of commercial 
markets. The plan’s objectives are to establish the 
framework for a world-class urban agglomeration; 
build a primary facility system with relevant hubs, 
functions, and networks; and lay the foundation 
for innovation-driven systems for regional indus- 
try and collaborative development by 2020. When 
these goals are fully realized, the area will become 
a major conduit for bringing global resources to the 
Asia-Pacific region as well as a world-class urban 
center with global influence. 

The high-quality integration of the Yangtze River 
Delta will no doubt reshape the layout of China’s 
economic geography—and possibly that of the 
world. 

The delta has already seen unprecedented changes 
even in the early stages of integration. First of all, 
in terms of adjusting administrative structures, 
the YRDUA has been guiding each city involved 
in forming its own unique functions by directing 
them to eliminate existing bureaucratic config- 
urations. Secondly, the area has implemented 
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flexible coordination mechanisms at higher levels, established 
cost-sharing initiatives in specific project operations, and 
driven cooperation between different regions via “win-win” 
measures instead of administrative directives. More impor- 
tantly, when high-speed growth was pursued, the delta used 
to focus on energy issues and traditional production factors 
such as labor force allocation; but it is now focused on stimu- 
lating innovation, fostering cooperation between science and 
technology industries and talent, and creating new industry 
highlands in order to play a more important role in reshaping 
the global economy. 

In addition, the YRDUA is ensuring that required resources 
are in place to increase connectivity within the delta and to 
build coordinated development demonstration areas. The 
establishment of regional intercity railway network planning 
for co-construction of infrastructures; the creation of a world- 
class urban agglomeration with the fastest network connec- 
tion speed through the installation of 5G networks; and the 
co-construction of the G60 Science & Technology Innovation 
Valley will be important approaches to securing high-quality 
integration in the region. 


Seeking Talent to Finish the Picture 


This “grand scroll” painting of Yangtze River Delta construc- 
tion is being completed gradually, waiting for more skilled 
painters to add their master strokes. 

Numerous high-level experts will act as these painters— 
and they will have a rich and varied “canvas” on which to 
work. Due to historic reasons and their differing levels of 
development, the delta’s administrative divisions have diverse 
features: Shanghai boasts a highly advanced and innovative 
science and technology market and a strong educational sys- 
tem; Jiangsu has a solid economic foundation and a powerful 
manufacturing base; Zhejiang is famous for its wonderful 
market vitality and creative business models; and Anhui is 
eager to embrace new technology. However, all these areas 
have something in common: They are all in urgent need of 
high-level talent. Even though the Yangtze River Delta holds 
one of the largest concentrations of intellectual capital in Chi- 
na, the concerns of its administrative divisions about bringing 
in more talent are still quite obvious. 

In regards to talent acquisition, the methods of Hangzhou and 
Ningbo, two Zhejiang-based cities famous for commerce, are a 
good example of the intense level of activity that is taking place. 
As two subprovincial cities in Zhejiang, Hangzhou and Ning- 
bo are recognized for their rapid and steady economic growth. 
In 2017, the GDP of these two cities reached RMB 1.13 trillion 
(USD 175.83 billion) and RMB 984.69 billion (USD 153.23 
billion), respectively, ranking first and second in the province. 


Hangzhou: Becoming a Talent Paradise 


Even though it is a city rich in history and culture, and once 
named by Marco Polo as “a paradise on earth,” Hangzhou 
hasn’t been constrained by its traditions. On the contrary, like 
other Chinese cities, it has experienced its fastest economic 
growth in a decade. Its GDP has been No. | in all of Zhe- 
jiang for several years. As a city with an economic aggregate 
exceeding RMB 1.13 trillion, Hangzhou is one of the few 
Chinese cities maintaining an annual economic growth of 
more than 8%. As a result, the city has instituted more strin- 
gent requirements for acquiring talent and is now bringing in 


specialists from overseas. In recent years, it has introduced 
29,000 high-level experts studying abroad and 15,000 foreign 
experts; foreigners have registered more than 4,980 new en- 
terprises (acting as legal persons); and in 2017 the city’s net 
inflow of both domestic and international talent ranked first 
place nationwide. 

In February 2018, Hangzhou issued opinions on how to accel- 
erate the internationalization of local talent as well as policies 
for acquiring overseas talent, such as providing a maximum 
of RMB 100 million (USD 143,500) as subsidies for foreign 
talent who start their business in the city, further intensifying 
the process. It’s worth noting that Hangzhou has realized the 
importance of universities and research institutes for attract- 
ing top talent. Besides universities like Zhejiang University, 
which consolidated its strength after merging with other 
institutions in 1998 and became one of China’s top schools, 
and Westlake University, which aroused fierce debates after 
its establishment and hopes to become a pioneer in China’s 
higher education system, Hangzhou plans to set up more in- 
fluential universities and scientific research institutes at home 
and abroad. 


Ningbo: On the Fast Track for Talent 


In terms of introducing universities, Ningbo shares Hang- 
zhou’s enthusiasm. It is expected that at least five universities 
will settle in Ningbo, including distinguished schools such as 
Zhejiang University, Beihang University, and the University 
of the Chinese Academy of Sciences (UCAS). The introduc- 
tion of prestigious universities will not only bring concrete 
economic benefits to Ningbo but also expand its channels for 
introducing talent. 

As Zhejiang’s second-largest city, Ningbo is facing increas- 
ingly fierce regional competitions for talent. In the hope of 
being one of the top talent reservoirs among China’s first-tier 
large cities, Ningbo issued four comprehensive talent policies 
within 10 months in 2017. In December 2017, it issued an an- 
nouncement on using private capital to introduce high-end en- 
trepreneurship teams, which it rated as the “most significant 
and striking” talent policy implemented by the city. It later 
announced the introduction of urgently needed high-level tal- 
ent on the first work day of 2018, and subsequently issued the 
details about requirements for residence migration in order 
to attract talent and entrepreneurship teams, which included 
lowering the threshold for settling in Ningbo and providing 
grants for projects, allowances for purchasing houses, and 
subsidies for settling down. In September 2018, the city pub- 
lished what it calls the “1+X” measures for developing a “talent 
ecosystem,” which would involve gathering top talent from 
expanding industries and forming a structure for “introducing, 
training, utilizing and retaining” them, while providing them 
with more support and more thorough guidance. 

With an improved integration process, the Yangtze River Del- 
ta’s economic dreams will definitely become a reality, as long 
the acquisition of talent keeps pace with urban development. 


We welcome domestic and foreign scholars to contact us. 
We will provide you with free, one-on-one personal service, 
inform you about Yangtze River Delta region colleges and 
universities, and help you to learn about and apply for talent 
recruitment projects in this region. For assistance, please con- 
tact the personnel consultant at consultant@acabridge.edu.cn. 
For more details, please visit our website at www.edu.cn/jjcsj. 
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SEU, Shaping China’s Research and Innovation Landscape 
“What the Great Learning teaches, is to illustrate illustrious virtue, to renovate the 
people, and to rest in the highest excellence.” 


Located in the historically and culturally 
significant city of Nanjing, Southeast Uni- 
versity (SEU) is one of the oldest institutions 
of higher learning in China. Its constant 
pursuit of the highest excellence has won it 
the accolades of ‘“‘a sacred place of learning”. 
Liuchao Juniper, the sacred and iconic tree 
on the flagship campus, weathered for over 
1500 years, is a witness to its unremitting 
effort for perfection in the past 116 years. 

The history of Southeast University can 
be traced back to 1902, when it was founded 
as Sanjiang Normal School. After that, it 
evolved successively into Liangjiang Nor- 
mal School, Nanjing Higher Normal School, 
National Southeast University and National 
Central University, until it was renamed 
Southeast University in 1988. The motto 
-- striving for perfection--is interpreted as 
“self-perfection, care, perfection, preemi- 
nence”. Upholding the motto, Southeast Uni- 
versity has made substantial progress over 
the years, which fully embodies its pursuit of 
the highest excellence. 

SEU is the place where the Science 
Society of China was established, thus the 
“birthplace of China’s natural sciences”. 
And it is at Mei’an-- the northwest corner 
of the campus-- that Xueheng Intellectual 
Group was founded, which was a milestone 
in the continuation of the Chinese traditional 
culture. SEU was the first co-educational 
university in China and the earliest adopter 
of the credit-based education system. It 
attracted a galaxy of elite returned scholars 
from overseas. Its tradition of excellence and 
pioneering work continue to echo down the 
decades. 

As China’s leading research-oriented 
university, SEU has developed China’s first 
robot and is the first Chinese university that 
has set up robotics undergraduate program 
and molecular bioelectronics laboratory. She 
is the cradle of China's modern architecture, 
and the most important information science 
and industrial innovation powerhouse. She 
started China’s art education, and has since 
occupied a dominant position in the research 
of art theory. 

SEU is one of the only three universities 


in China that have persisted on undertaking 
Special Class for the Gifted Young, and 
one of the pilot universities with a series of 
reforms in the training mode aimed at devel- 
oping top and innovative talents. It has fos- 
tered over 200 academicians of the Chinese 
Academy of Sciences and Chinese Academy 
of Engineering and nurtured over 330,000 
outstanding graduates. 

SEU was one of the few key universities 
that have been supported by the Chinese 
government's “211 Project”, “985 Project” 
and now is one of the “Double First-class 
Initiative” universities, with 11 disciplines 
selected, ranking 8th among all Chinese 
universities. SEU offers 76 undergraduate 
programs and 33 first-level Ph.D. programs. 
Five disciplines, namely architecture, civil 
engineering, and transportation engineering, 
biomedical engineering, art theory rank 
No.1 in China; 11 disciplines including 
engineering, computer science, material 
science, mathematics, physics, chemistry, 
clinical medicine, biology and biochemistry, 
pharmacology and toxicology, neuroscience, 
behavioral sciences, social sciences are 
among ESI world top 1%, where engineering 
and computer science are among the top one 
thousandth. 

SEU values the coordinated, integrated 
development of engineering, science, liberal 
arts and medical science. Powered by its 3 
key state laboratories, 3 national engineer- 
ing research centers, 2 national engineering 
technology research centers and other lead- 
ing research platforms, SEU pushes for the 
world's scientific and technological frontiers 
and sticks to major national demands, com- 
mitted to major strategic high-tech research, 
fundamental and applied basic research and 
cross-disciplinary integration. SEU continues 
to make the breakthroughs in the emerging, 
cutting-edge and interdisciplinary fields such 
as quantum information, artificial intelli- 
gence, cyber security, intelligent manufac- 
turing, smart city, intelligent transportation, 
brain science and biomedical big data. SEU 
has been involved in and made major contri- 
butions to the projects such as Three Gorges 
Project of Yangtze River, Five-hundred-me- 
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ter Aperture Spherical Radio Telescope 
construction, 5G mobile communication 
technology development, Antarctic scientific 
investigation, Manned Space Project, High- 
speed Railway network, Hong Kong-Zhu- 
hai-Macao Bridge in China. 

Boasting internationalization, SEU is 
a university of significant global impact. 
Scholars from all over the world come here 
to engage in teaching, research and exchange 
programs. Top students from all countries 
are attracted to the university in pursuit of 
their dreams. 

Passing on the century-old tradition, inte- 
grating into the world development, SEU es- 
tablished AMS (Alpha Magnetic Spectrome- 
ter) Experimental Center jointly with Nobel 
laureate Prof. Samul Ting and SEU Shing- 
Tung Yau Center with Prof. Shing-Tung Yau, 
and set up Architecture Internationalization 
Demonstration School. SEU also initiated 
and established the UK-China University 
Consortium on Engineering Education and 
Research (UKCCEER), aiming to promote 
and facilitate strategic engagement and 
bilateral cooperation in transnational joint 
programs on postgraduate higher education 
and engineering research between the UK 
Partners and the China Partners, in the areas 
of energy and intelligent manufacturing in 
support of both countries’ manufacturing 
ambitions. SEU has also engaged in coop- 
eration and exchange programs with more 
than 150 overseas universities and research 
institutions. 

“Strive for Perfection” is our century-old 
motto and value. “Becoming a world-re- 
nowned university for the sciences and 
dedicating itself to the country with talents” 
is our school-running philosophy. Attracting 
and gathering the best teachers, cultivating 
the leading talents with a strong sense of na- 
tional pride and global vision for the future 
and the benefit of mankind, innovating the 
best scientific and technological achieve- 
ments, contributing wisdom and strength to 
the social development and human progress 
are not only SEU’s spirit and soul, but also 
our timeless creed and pursuit. 


online @sciencecareers.org 
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Opportunities to shine at 
ShanghaiTech University 


ShanghaiTech University is a young and dynamic higher education institution aiming for high-quality research and global 
influence. To address challenges faced by China and the world, it seeks innovative solutions in energy, materials, environment, 
human health, data science, artificial intelligence (AI), and electrical engineering. An integral part of the Zhangjiang Compre- 
hensive National Science Center, the university is now leading several frontier research projects at large-scale facilities. For 
more information, please visit: www.shanghaitech.edu.cn. 


We are now seeking talent- 
ed researchers for multiple 
faculty positions at all ranks 
in the following fields: 
School of Physical Science 
and Technology: energy, 
system materials, photon and 
condensed state, material bi- 
ology, environmental science 
and engineering 

School of Life Science and 
Technology: molecular 

and cell biology, structur- 

al biology, neuroscience, 
immunology, stem cells 

and regenerative medicine, 
system biology and biologi- 
cal data, molecular imaging, 
biomedical engineering 
School of Information 
Science and Technology: 
computer science, electrical 
engineering, information engi- 
neering, artificial intelligence, 
network and communication, 


virtual reality, statistics, big 
data and data mining 

School of Entrepreneur- 
ship and Management: 
economics, finance, account- 
ing, management, marketing, 
strategy and entrepreneurship 
School of Creativity and 
Art: film production, life 
drawing, photography, VR 
and game coding, illustration 
& visualization, performing 
arts 

Shanghai Institute for Ad- 
vanced Immunochemical 
Studies: antibody therapy, 
Immunotherapy, cell therapy, 
regeneration medicine 
iHuman Institute: bio-im- 
aging, biology, chemistry, 
computational biology, AI/ML 
Institute of Mathematical 
Sciences: pure mathematics, 
theory of computing, applied 
mathematics 


Successful applicants will 
have a doctoral degree, and 
are expected to establish a re- 
cord for independent, interna- 
tionally recognized research, 
supervise students and teach 
high-quality courses. 


ShanghaiTech University 
will offer attractive compen- 
sation packages, including: 
Initial research support 
package: reasonable start- 
up funds, research associates 
and post-doctoral fellows, 
laboratory space to meet 
research needs 
Compensation and ben- 
efits: highly competitive 
salary commensurate with 
experience and academic 
accomplishments, a compre- 
hensive benefit package Sub- 
sidized housing: on-campus, 
80/100/120 m’ faculty apart- 


ments available at low rent 
for tenure and tenure-track 
faculty 

Relocation & travel al- 
lowance: reimbursement 

of expenses for household 
relocation and family’s one- 
way travel 

Family assistance: support 
with children’s education; 
affiliated kindergarten, pri- 
mary and middle schools are 
under construction 


To apply: using this format, 
please submit a cover letter 
(Firstname_Lastname_Cov- 
er_Letter.pdf), a research 
plan (Firstname_Lastname_ 
Research_Plan.pdf), anda 
CV (Firstname_Lastname_ 
CV.pdf) to shanghaitechuni- 
versity@gmail.com. 
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Shanghai University of Engineering Science 
Talents Recruitment 2018 


I. About the University 


Shanghai University of Engineering Science (SUES) is a full- 
time regular institution of higher education with cross-discipli- 
nary and coordinated study in Engineering Science, Economic 
Administration, and Art & Design. Located in the college town of 
Songjiang, Shanghai, the 250-acre university offers undergraduate 
programs in 75 academic and professional majors, postgraduate 
programs in 4 first-tier and 22 second-tier disciplines, as well 
as 3 professional degree programs, meanwhile, an international 
cultivation base is authorized to confer doctoral diplomas. Some 
20,000 students take fulltime courses at the university (nearly 
2,500 of them postgraduates), and 1,200 staff contribute to the 


II. Job Profile: 


Ill. Application Channels 


teaching profession. 

As one of the pilot programs of the training of outstanding 
engineers, which are directed by the Ministry Of Education of 
People’s Republic of China, disciplines and majors in Shanghai 
University of Engineering Science keep close to the modern 
industries in Shanghai, and have exhibited highly characteristic 
features. 

With its continuing expansion in cooperation and exchange 
with international institutions, SUES has so far conducted coop- 
eration and exchange on an extensive scale with several dozen 
universities in countries such as America, Britain, and France. 

SUES now sincerely welcomes overseas talents who are inter- 
ested in higher education to join us. 


IV. Contact Information 


(Either works) 


1) Crew members for talents programs in 
Shanghai University of Engineering Science: 
Zhihong Scholars, Tengfei Program, Zhanchi 
Program. 


2) Teaching and researching faculty 
3) Outstanding fresh graduates with a doctorate. 


V. Disciplines (Including but not limited to) 


Science, Economics (Economics and Trade, Finance, etc.), Man- 
agement Science (Business Administration, Management Science 
and Engineering, Logistics Management and Engineering, Industry 
Engineering, Tourism Administration, Public Administration, etc.), 
Literature (Journalism and Communication, Foreign Languages and 
Literatures, etc.), Art (Art, Design, etc.), Engineering (Mechanical 
Engineering, Power and Energy, Control Science and Engineering, 
Mechanical Engineering, Computer Sciences, Automation, Electri- 
cal/Electronic Engineering, Materials Science, Chemical and Phar- 
maceutical Engineering, Environmental Science and Engineering, 
Traffic and Transportation, Vehicle Engineering, Textile Engineering, 
Fashion Design and Engineering, Aeronautical Engineering, etc.) 

Disciplines related to Mechanical Engineering are mainly 
recruited for the School of Automotive Engineering. The school 
now has 6 undergraduate departments, 1 first-tier postgraduate 
program in Mechanical Engineering, and 5 second-tier postgrad- 


1) You can submit your application online 
through our Talents Recruitment System 
(http://zhaopin.sues.edu.cn). 

2) Send your application materials and job 


Address: 333 Longteng Road, Song- 
jiang District, Shanghai 

Contact Person: Ms. Miao, Ms. Zhu 
Numbers: (+86) 021-67791252 


objectives to us at jsk@sues.edu.cn. 


uate programs. The teaching staff is rationally coordinated in age, 
knowledge, and disciplines. All members prove academic compe- 
tence, 20 of them professors, 50 associate professors, 3 doctoral 
students’ supervisors, and 99 lecturers with doctoral diplomas. 

Over the past 3 years, the school has undertaken 163 scientific 
research projects, 30 of which are at the national level, 10 at the 
provincial level, 114 are industry-based. At present, 17 laborato- 
ries such as the Associate Center of Modern Equipment and Its 
Control Technologies have been established, covering an area of 
3 acres, and many internship programs have been developed with 
more than 100 institutions. The school also lays emphasis on its 
openness to the outside world. A close relationship with 7 scientif- 
ic research institutes such as University of Pennsylvania has been 
built, and in 2017, the school and University of Pennsylvania have 
reached an agreement on a “2+2” joint education program. 

For actively promoting the construction plan of its first-class 
discipline of Mechanical Engineering, School of Automotive En- 
gineering is now recruiting talents from all over the world. 


I. PROFILE OF JKXUST 


Founded in 1958, Jiangxi University of Science 
and Technology (hereinafter shorted as JXUST) is 
jointly-sponsored by the Ministry of Education, the 
Ministry of Industry and Information Technology 
and Jiangxi Provincial Government. As one of 
the top-ranking universities in Jiangxi Province, 
JXUST offers a full range of undergraduate majors, 
plus master's and Ph.D. programs. It serves as an 
important base for the education and research of 
non-ferrous metallurgy industry in China. With an 
outstanding faculty trained globally, it is making 
great strides toward its goal of building a nationally 
first-class institution of higher education and cutting- 
edge research with broad societal impacts. 


II. A QUICK INTRODUCTION OF THE FORUM 


For the first time in its history, JXKUST will host 
a program for younger scholars, defined as those 
scholars with no more than ten years of tenure-track 
faculty experience. This includes graduate students 
as well as post-doctoral fellows, lecturers and 
visiting affiliates who have yet to secure a continuing 
faculty appointment. 

In addition to serving as a platform for discussion 
and thought, the forum is designed to promote 
collaborative intellectual and research activity among 
researchers from different institutions and from a 
range of disciplines in science and engineering. This 
forum will offer young scholars the opportunity to 
present their work to their peers and to distinguished 
scholars from around the world. 


III. AcADEMIC DISCIPLINES 


Academic disciplines involved in this Forum are as 
follows: Mining Engineering, Metallurgical 
Engineering, Material Science and Engineering, 
Chemical Engineering, Environmental Engineering, 
Civil Engineering, Mechanical Engineering, 
Electrical Engineering, Control Engineering, 
Information Technology, Computer Science and 
Engineering, Intelligent Manufacturing. New 
Material, New Energy, Economics, Management, 
Finance, Mathematics, Physic, Chemistry etc. 


IV. QUALIFICATIONS FOR APPLICANTS 


Applicants for this forum, who shall be under the age 
of 40, are expected to have the wish of joining the 
faculty of IXUST. 

In addition to having 2 years or more of post- 
doctoral research or work experience, applicants, 
who are expected to have a PhD degree in a relevant 
science and engineering field, must obey Chinese 
laws and have a proven track record of high- 
quality peer-reviewed academic publications. They 
are also encouraged to apply for the four national 
talents programs (“1000 Talents Program for Young 
Professionals”, “Young Top Talents of the National 


High-Level Talents Support Plan”, “Young Chang 
Jiang Scholars Program” and “The National Science 
Fund for Distinguished Young Scholars”), sponsored 
by the Central Government of China through JXUST. 


V.EXPENSES REFUND 


The board and lodging expenses of the participants 
during the forum will be covered by the host. In 
addition, JXUST will finance participants’ travel 
subsidies, including round-trip international airfare 
(Economy class seat), and transportation expenses 
within Chinese territory. 

The travel subsidy shall be no higher than RMB 
15,000 per person for scholars from European and 
American area, no higher than RMB 7,000 per 
person for scholars from Asian-Pacific region and no 
higher than RMB 5,000 per person for scholars from 
China (including Hongkong, Macaw and Taiwan). In 
addition, JKUST will arrange free pick-up services 
in Ganzhou airport and railway station during the 
forum. 


VI. SCHEDULE TIMELINE 


1. Application Deadline: Dec. 10th, 2018. 

2. Invitation Period: Sep. 25th — Dec. 10th, 2018. 

3. Registration Date: Dec. 26th, 2018. 

4. Forum duration: Dec. 27th- 28th, 2018. 

Forum venue and airport (railway station) pick-up 
arrangement will be specified in the invitation letter. 


VII. SALARIES & BENEFITS 


JXUST invites applications for tenure-track or 
tenured faculty positions at all ranks in all major 
science and engineering disciplines. Candidates with 
research interests in all related areas are encouraged 
to apply. There are extraordinary opportunities to 
develop major research and education programs 
with collaborations with other academic/industrial 
organizations nationwide and worldwide. 

JXUST, in line with nationally first-class research 
universities, provides globally competitive 
compensation and benefit packages to the elite 
professionals. 


1.Successful applicants of the four above- 
mentioned national talents programs 


Candidates will be appointed to the faculty of JXUST 
at a level commensurate with each applicant’s 
background and experience. JXUST offers a 
generous salary and startup package, including: 

(1) Position & Salary: the successful candidate will 
be appointed as Chair Professor, with an annual 
salary of RMB 600,000 to 800,000 Yuan. 

(2) Research Funding: a start-up fund of no less 
than RMB 3 million for science and engineering 
disciplines, and RMB 0.5 million for disciplines 
of humanities and social sciences (national and 
provincial level research funding excluded); 
platform construction funding will be considered by 


Dec. 27-28, 2018 


evaluation on actual conditions and needs. 

(3) Housing Subsidy: an on-campus transitional 
housing with an area of approximate 100m2 and a 
housing subsidy of RMB 1.5 million (governmental 
award or subsidy excluded) provided in the period 
of employment. 

(4) Living subsidy for accompanying spouse: a job 
offer to the candidate’s spouse based on his/her 
qualifications, or a monthly allowance of RMB 2,000 
for a duration of 6 years. 

(5) Children’s school admittance: the children of 
successful applicants are entitled to choose quality 
primary and secondary schools in line with local 
policies. 

(6) Other supporting benefits: In addition to sound 
scientific research conditions, JXKUST will provide 
research assistants on the basis of the actual needs. 


2. Overseas-educated PhD degree holders 


For excellent PhDs with at least two-year 
overseas education background or with doctorate 
degree of prestigious overseas universities, the 
benefit packages include: 

1)Successful candidate will be appointed as associate 
professor and is entitled to corresponding salary 
and benefits, or appointed in accordance with tenure 
track system (no less than RMB 250,000 per year); 
2)a living allowance no less than RMB 250,000, 
and an on-campus transitional housing or a housing 
subsidy of no less than RMB 150,000 provided; 

3)a start-up research funding of RMB 200,000 for 
scholars of science and engineering disciplines and 
RMB 100,000 for scholars of humanities and social 
sciences; 

4)a Spouse's allowance of RMB 100,000 or a job 
offer based on a spouse's qualifications; 
5)assistance in children’s school enrolment; 
above benefits are negotiable for very outstanding 
applicants. 

Above benefits are negotiable based on the 
qualifications of applicants. JKUST is committed to 
offer selected applicants with competitive salaries 
and benefits no lower than the standard of its kind in 
Jiangxi Province. 


VUI. APPLICATION PROCEDURES 


Fill in the Application Form online 
(Website: http://zpjob.acabridge.cn/frm/forum/ 


join?f_id=17), or download the Application Form, 
fill in and send it to below E-mail address. 


IX. Conract INFORMATION 


Tel: 0086-797-83 12591 
Mobile: 0086-15297767555, 
0086-19979706787 

Email: jxlgrczp@163.com, 7 Fo 
justrsc@vip.163.com [a] SLR 
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PHEIUNIVERSTLY SOF TEXAS OPEN RANK, TERM TENURE-TRACK / 


MD Anderson TERM TENURE 

aRneer ( e DEPARTMENT OF ENDOCRINE NEOPLASIA 
nter AND HORMONAL DISORDERS 

Making Cancer History’ 


The Department of Endocrine Neoplasia and Hormonal Disorders at The University of Texas 
MD Anderson Cancer Center seeks an individual of outstanding research capability in basic or 
translational sciences related to malignancies of the endocrine system, or more general endocrine 
disorders in the context of cancer. He/she will coordinate with existing departmental faculty research 
programs, with particular emphasis on endocrine malignancies (particularly thyroid and adrenal) and 
bone, reproductive, and metabolic abnormalities in cancer. The successful applicant will be expected 
to establish a competitively funded research program. Graduate teaching opportunities are available, 
a strong commitment to mentoring is encouraged, and performance of departmental and institutional 
service is expected. Applicants must be committed to working in a highly collaborative, multidisciplinary 
environment. Generous start-up packages are available for up to five years. 


MD Anderson, ranked the #1 cancer center by U.S. News & World Report, is the world’s largest treatment 
facility for oncologic diseases. Located within the Texas Medical Center campus in Houston, our location 
provides access to a world-renowned medical community and the splendid cultural and recreational 
diversity of a sophisticated, metropolitan area that is the country’s fourth largest city. 


Qualifications include a Ph.D. or equivalent degree in a relevant basic science field. Post-doctoral experience 
is a must, and evidence of research excellence and extramural funding is expected appropriate to rank. 


Interested applicants should send letter of intent, curriculum vitae, contact information for three letters of 
reference, and research plan to: 


Marie-Claude Hofmann, Ph.D. Professor 

The University of Texas MD Anderson Cancer Center 
Unit 1461, PO Box 301402, Houston, Texas 77230-3722 
Email: mhofmann@mdanderson.org 


MD Anderson Cancer Center is an equal opportunity employer and does not discriminate on the basis of race, color, religion, age, 
national origin, sex, sexual orientation, gender identity/expression, disability, veteran status, genetic information, or any other basis 
protected by federal, state, or local laws, unless such distinction is required by law. All positions at The University of Texas MD 
Anderson Cancer Center are security sensitive and subject to examination of criminal history record information. Smoke-free and 


drug-free environment. 


HEALTH 


DEPARTMENT OF IMMUNOLOGY 
UNIVERSITY OF CONNECTICUT SCHOOL OF MEDICINE 


The Department of Immunology at the University of Connecticut, School of Medicine, 
seeks an outstanding investigator for a tenure-track position at the Assistant or Associate 
Professor rank to establish an extramurally funded laboratory. We are searching for faculty 
candidates in all areas of Immunology including cellular and molecular immunology in 
various fields of infectious and inflammatory diseases, cancer, autoimmunity, vaccines, 
metabolism and others. Prospective candidates should bring innovative ideas and cutting 
edge technology to an already vibrant immunology community consisting of expertise 
in both adaptive and innate immunity. The ideal candidate will participate in graduate 
student training, and have access to a growing translational research community and an 
expanding scientific environment in the capital region. Salary and start-up funds are highly 
competitive and outstanding core facilities are available. Applicants must have a Ph.D. and/ 
or M.D. with several years of postdoctoral training and a high impact publication record. 
For Associate Professor level, a history of extramural funding is expected. In addition to 
the beauty of the picturesque New England countryside, the Hartford area offers a lively 
art and cultural scene and an exceptional outdoor sports environment. 


Inasingle PDF file please submit a curriculum vitae, maximum two-page summary of research 
accomplishments and interests, and the names and contact information of three references 
through the UConn Health Employment Services website, https://jobs.uchc.edu. Search 
no. 2019-260. Please address questions to the search committee chair, c/o Ms. Kimberly 
Young (Email: immunology @uchc.edu). For further information on UCH, please visit https:// 
health.uconn.edu/immunology. The deadline to submit applications is January 31, 2019. 


UConn Health is an Affirmative Action and Equal Employment Opportunity employer, 
who encourages Males, Females, Veterans, Minorities and Persons with Disabilities to apply. 


mylDP: A career plan 
customized for you, by you. 


There’s only one Science. 


Recommended by 
leading professional 
Bees ~societies and the NIH 


Features in myIDP include: 


= Exercises to help you examine 
your skills, interests, and values. 


A list of 20 scientific career paths 
with a prediction of which ones 
best fit your skills and interests. 


A tool for setting strategic goals 
for the coming year, with optional 
reminders to keep you on track. 


Articles and resources to guide 
you through the process. 


Options to save materials online 
and print them for further review 
and discussion. 


A certificate of completion for 
users that finish mylDP 
and more. 


Start planning today! 
mylDP.sciencecareers.org 
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University of California F U N D eS 


San Francisco 


cimmememereieenl SOIT Our Leadership, Clinical, and Research Faculty Teams 


The University of New Mexico Comprehensive Cancer Center (UNMCCC) is the Official Cancer Center of New Mexico and 
the only National Cancer Institute (NCI) designated comprehensive cancer center in a 500-mile radius. Our 134 oncology 
physicians, 122 cancer research scientists, and staff focus on discovering the causes and cures for cancers disproportionately 
affecting the people of the American Southwest — primarily Hispanic, American Indian, and Non-Hispanic White — with 
strikingly different patterns of cancer incidence, mortality and disparity. In the past year, our center cared for 12,000 
patients; 12 percent participated in therapeutic interventional studies and 35 percent in interventional studies. UNMCCC 
has outstanding programs in Cancer Control and Cancer Health Disparities; Cancer Genetics, Epigenetics, and Genomics; 
Cancer Cell and Systems Biology; and Cancer Therapeutics. Our research houses national centers: The Molecular Discovery 
and High Throughput Target Screening Center (nmmlisc.health.unm.edu), one of six Chemical Biology Consortium Centers 
of Excellence in The NCI NExT Program; Spatiotemporal Modeling of Cell Signaling (stmc.unm.edu), one of 13 NIH National 
Centers for Systems Biology; and a NIH Clinical and Translational Sciences Center. We enrich our endeavors by collaborating 
with Sandia and Los Alamos National Labs and Lovelace Respiratory Research Institute. Benefit from our Shared Resources 
including biospecimen collection and tissue analysis, genomics, biostatistics, bioinformatics, population science and behavioral 
interventions, and the conduct of clinical interventions. UNMCCC is the center of our statewide cancer clinical trials and health 
delivery research network — partly funded by a NCI NCORP Grant — and is an Oncology Research Information Exchange 
Network (ORIENcancer.org) member. Our center has conducted 60+ statewide community-based cancer education, 
prevention, screening, and behavioral intervention studies involving more than 10,000 New Mexicans. Visit cancer.unm.edu. 


Cancer Cell Signaling & Systems Biology Health Services & Behavioral Intervention 
(Tumor Microenvironment) Two Positions: Associate, Full Professor ¢ Assistant Professor 
Seeking cancer cell biology, signaling, and systems biology Seeking faculty with scholarly achievements in health services, 
experts with interests in dissecting mechanisms of perturbed cancer care delivery, or behavioral intervention research, with 
signaling in cancer cells, analysis and modeling of pathways interest in the minority populations of the American South- 


mediating therapeutic response or resistance, and analysis of | west. Search chairs: Linda Cook and Larissa Myaskovsky 
cellular and signaling interactions and the immune response 


in the tumor microenvironment. Search chairs: Diane Lidke Cancer Molecular & Genetic Epidemiology 
and Eric Prossnitz Two Positions: Associate/Full Professor « Assistant Professor 

. Seeking scientists in cancer population and molecular genetic 
Target & Drug Discovery and/or epigenetic epidemiology, particularly as it is used to 
Seeking scientists and physician scientists focused on dis- assess/impact cancer health disparities, gene-environment 
covery and development of cancer diagnostic, therapeutic, interactions, and genetic ancestry and genetic risk assess- 
and imaging agents in a therapeutics pipeline using innova- ment in multi-ethnic populations. Search chairs: Marianne 
tive flow cytometric and other high throughput functional Berwick and Linda Cook 
screening methods and chemo-informatics platforms for drug 
discovery. Search chairs: Larry Sklar and Alan Tomkinson Cancer Autophagy 

: Seeking funded scientists studying the regulation and roles of 

Epigenetics & Functional Genomics autophagy related to cancer biology with an interest in basic 
Seeking experts in fundamental mechanisms of chromatin and translational research. Looking for applicants researching 
regulation and epigenetics in cancer model systems and the intersection of autophagy and cancer, in areas including 


human tissues, with interests in defining epigenetic signatures microenvironmental and oxidative stress, tumor cell growth 
in model systems and population cohorts in response to envi- and aggressiveness, and mitochondrial function. Search 
ronmental carcinogens prevalent in the American Southwest. chairs: Eric Prossnitz and Vojo Deretic 


Search chairs: Alan Tomkinson and Scott Ness 
Cancer Immunology 


Biostatisticians Seeking established mid-career or senior scientists and physi- 
Two Positions: Associate/Full Professor « Assistant Professor cian-scientists studying cancer immunology and/or the tumor 
Seeking PhD biostatisticians to join an outstanding team microenvironment. Looking for scientific accomplishments 


engaged in statistical methodology relevant to cancer andin — exemplified by peer-reviewed funding and collaborative 
biostatistical applications integrated with basic, translational, research in cancer immunology, immunotherapy, and/or the 


clinical, and population science research. Search chairs: Linda tumor microenvironment. Search chairs: Eric Prossnitz and 
Cook and Shane Pankratz Sarah Adams 


For details and to apply, visit cancer.unm.edu/JoinTheBest 
Questions? Contact Search Coordinator Amanda Leigh at ALeigh@salud.unm.edu, (505) 272-2201. 


UNM is an Equal Opportunity/Affirmative Action Employer and Educator 


= 


Endowed Chairs and Professorships, significant resources, leadership roles, 
and comprehensive start-up packages available. 


Photo: Bill Tondreau, “River Edge”, panoramic photographic, courtesy of sumnerdene.com 
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"3 Smithsonian Institution 


Research Zoologist 
Department of Invertebrate Zoology 
National Museum of Natural History 

Smithsonian Institution 


The Smithsonian’s National Museum of Natural History seeks a 
zoologist to conduct an integrative, specimen- or collection-based 
research program in invertebrate evolution and biodiversity (exclusive 
of hexapods, myriapods, and arachnids). The successful candidate is 
expected to develop an internationally recognized research program that 
makes important contributions to understanding invertebrate evolution 
and biodiversity through synthetic research involving phylogenetics, 
genetics, anatomy, development, genomics, biogeography, conservation, 
informatics, or related fields. Frequent publication of highly regarded 
papers in competitive, peer-reviewed journals, curation of collections in 
specialty area, service to the scientific community in leadership capacities, 
acquisition of external funding, engagement in outreach activities, and 
mentorship of students are expected. 


Full-time, permanent appointment with full Government benefits to be 
filled at the GS-12 level; US citizenship and a one-year probationary 
period are required. The museum’s authorized salary range for this 
position at this time is $81,548 - $86,984 per year. College transcripts 
and proof of U.S. accreditation for foreign study must be submitted 
online by the closing date of announcement. For complete requirements 
and application procedures go to www.sihr.si.edu or www.usajobs. 
gov and refer to Announcement 19A-JW-304220-DEU-NMNH. The 
announcement opens November 7, 2018. Applications and all supporting 
documentation must be received on-line by January 7, 2019 and must 
reference the announcement number. All applicants will be notified by 
email when their application is received. 


The Smithsonian Institution is an Equal Opportunity Employer. 


é"3 Smithsonian Institution 


Research Zoologist 
Department of Vertebrate Zoology 
National Museum of Natural History 
Smithsonian Institution 


The Smithsonian’s National Museum of Natural History seeks a zoologist 
to conduct an integrative specimen- or other collection-based research 
program in vertebrate evolution and biodiversity, in the disciplines of 
herpetology, ichthyology, mammalogy, and/or ornithology, especially 
mammalogy. The successful candidate is expected to develop an 
internationally recognized research program that makes important 
contributions to understanding vertebrate evolution and biodiversity 
through integrative research involving phylogenetics, anatomy, 
development, genomics, biogeography, conservation, informatics, 
or related fields. Frequent publication of highly regarded papers in 
competitive, peer-reviewed journals, curation of collections in specialty 
area, service to the scientific community in leadership capacities, 
acquisition of external funding, engagement in outreach activities, and 
mentorship of students are expected. 


Full-time, permanent appointment with full Government benefits to be 
filled at the GS-12 level; US citizenship and a one-year probationary period 
are required. The museum’s authorized salary range for this position at this 
time is $81,548 - $86,984 per year. College transcripts and proof of U.S. 
accreditation for foreign study must be submitted online by the closing date 
of announcement or your application will be disqualified. For complete 
requirements and application procedures go to www.sihr.si.edu or 
www.usajobs.gov and refer to Announcement 19A-JW-304235-DEU- 
NMNH. The announcement opens November 1, 2018. Applications and 
all supporting documentation must be received on-line by December 
13, 2018 and must reference the announcement number. All applicants 
will be notified by email when their application is received. 


The Smithsonian Institution is an Equal Opportunity Employer. 


IOWA STATE UNIVERSITY 
Assistant Professor in Systems Biology 


Department of Genetics, Development & Cell Biology 


The Department of Genetics, Development and Cell Biology (GDCB) 
at Iowa State University (ISU) (https://www.gdcb.iastate.edu) 
invites applications for a tenure-track assistant professor position 
in Systems Biology. 


GDCB seeks to hire a systems biologist who addresses fundamental 
questions at a cellular or molecular level, using a combination of 
experimental (e.g., genomics, transcriptomics, proteomics, metabolomics, 
high-content microscopy) and computational (network analysis, 
modeling) approaches to address fundamental aspects of complex cellular 
or organismal functions. Scientists working in any model organism or 
across diverse species are encouraged to apply. Areas of interest include 
but are not limited to: cellular and developmental processes integral to 
plant or animal health or disease, phenomics, genetic and metabolic 
regulatory networks, and responses to environmental signals and stresses. 
Interdisciplinary or collaborative research that complements existing 
strengths of the department and ISU is encouraged. 


Responsibilities include building a nationally recognized research 


program that competes successfully for extramural funding, advancing 
the discipline through high-quality publications, mentoring students, 
and effective teaching of undergraduate and graduate courses. The 
successful candidate will demonstrate excellent communication and 
leadership skills and will share the university’s commitment to an 
inclusive environment that supports diversity. 


Required qualifications include a Ph.D. in life sciences or relevant 
discipline and published record of high-quality research. 


To view the full job description and to apply, please visit 
https://www.iastatejobs.com/postings/36729 to view the entire 
vacancy and apply electronically. For full consideration, submit the 
application by December 7, 2018. 


Iowa State University is an EO/AA Employer. All qualified applicants 
will receive consideration for employment without regard to race, color, 
religion, sex, national origin, disability, or protected Vets status. 


Search more 
jobs online 


Access hundreds of job postings 
on ScienceCareers.org. 


Expand your search today. 
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HOWARD HUGHES MEDICAL INSTITUTE 


2019 Hanna H. Gray Fellows Program 


The Howard Hughes Medical Institute (HHMI) is pleased to announce the 2019 Hanna H. Gray Fellows Program competition. This 
program seeks to increase diversity in biomedical science by recruiting and retaining individuals from groups underrepresented 
in the life sciences. Through their successful careers, HHMI Hanna Gray Fellows will inspire future generations of scientists from 


America’s diverse talent pool. 


The competition opens September 12, 2018. 


Eligibility: 
The program is open to individuals who: 


hhmi 


Fellows will receive funding for their postdoctoral training and may continue to receive funding during their early career years as 
independent faculty. The program includes opportunities for career development, including mentoring and active involvement in 
the HHMI scientific community. The Institute will select and support up to 15 fellows in this competition. 


This grant competition is open to all eligible applicants and no nomination is required. 


e are from gender, racial, ethnic, and other groups underrepresented in the life sciences, including those individuals from 


disadvantaged backgrounds. 


e hold a PhD and/or MD (or equivalent), which must be conferred by the start of the grant term. 


e U.S. citizens must have a degree from a research institution in the U.S. (including Puerto Rico)or an international research 


institution. 


e Non-U.S. citizens and applicants with other nationalities must have a degree from a research institution in the U.S. (including 


Puerto Rico). 


e have no more than 16 months of postdoctoral research experience at the time of the application due date. 


Applications are due on January 9, 2019. Mentor and reference letters must be received by January 16, 2019. 
Further details: https:/Awww.hhmi.org/programs/hanna-h-gray-fellows-program or contact fellows@hhmi.org. 


Duke University Medical Center 


ENDOWED CHAIR POSITIONS FOR RESEARCHERS IN THE 
DEPARTMENT OF PATHOLOGY 


Applications are invited for appointment as Assistant Professor, Associate 
Professor or Professor in the Department of Pathology in the Duke University 
School of Medicine to fill recently created endowed chair positions. We 
seek individuals with an innovative research program in any area related 
to Cancer, Immunology or Biotechnology/Bioengineering intersecting with 
disease research. 


Applicants at the Assistant Professor rank are expected to have a publication 
and training history that will lead to sustainable external research funding. 
Applicants at the Associate Professor rank are expected to have earned extensive 
external research funding while applicants at the Professor rank are expected to 
have demonstrated the ability to maintain extensive external research funding. 


New faculty members will be supported with competitive start-up funding 
packages, research laboratory space, access to research animal facilities, and a 
wide range of state-of-the-art core research facilities (https://medschool.duke. 
edu/research/shared-resources/core-research-facilities). New faculty members 
are expected to participate in the education and training of medical and graduate 
students and postdoctoral and clinical fellows. 


Information about the Department of Pathology at Duke may be found here: 
https://pathology.duke.edu 


Applicants should submit a curriculum vitae, a brief statement of past research 
accomplishments and future research interests, and arrange to have three 
letters of recommendation submitted via https://academicjobsonline.org/ 
ajo/jobs/12743 


Complete applications (including letters of reference) submitted by March 1, 
2019 will receive full consideration. 


Duke University is an Affirmative Action/Equal Opportunity Employer. 


UPMC | exxces 
CANCER CENTER 
Affiliated with the University of Pittsburgh School of Medicine 


Assistant/Associate/Full Professor at UPMC Hillman Cancer 
Center and the University of Pittsburgh Department of 
Computational and Systems Biology 


We invite applications for new tenure-track or tenured Faculty positions at 
the Assistant Professor, Associate Professor, and under exceptional cases 
Full Professor level, at UPMC Hillman Cancer Center (Hillman) and in the 
Department of Computational and Systems Biology (DCSB) at the University 
of Pittsburgh School of Medicine. We seek creative individuals who can develop 
a cutting-edge research program and lead collaborative research initiatives 
on the development and use of novel computational models, methodologies, 
machine learning algorithms and quantitative analysis and dissemination tools 
to address current challenges in cancer research and to facilitate the translation 
of basic research into clinical applications. The department, DCSB and/or 
Hillman provide an integrative and supportive research environment to engage 
in interdisciplinary science with ample space and facilities for both dry- and 
wet-lab work. Research focus areas of interest include, but are not limited to, 
single cell transcriptomics/genomics, cancer heterogeneity, cancer phylogeny; 
microbiome and its effect on immune regulation; dynamic systems modeling 
in cancer; 4D genome architecture and dynamics; anticancer drug discovery 
and systems pharmacology. 

Level of appointment and salary will be commensurate with qualifications, 
experience and responsibilities. The successful candidate will have the 
opportunity to mentor students at the Joint Carnegie Mellon University— 
University of Pittsburgh Ph.D. Program in Computational Biology, as well 
as those in the recently founded Integrative Systems Biology PhD program. 
Applicants are kindly requested to submit to Ms. Lola Thompson 
(thompsonla3@upmc.edu) a cover letter, curriculum vitae, list of three 
referees, and research statement via email. The review of applications will 
begin on December 1, 2018 and will continue until the position(s) are filled. 

The University of Pittsburgh is an Affirmative Action, Equal Opportunity 
Employer. EEO/AA/M/F/Vets/Disabled. 
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PRINCETON | Princeton 


Environmental 


UNIVERSITY % Institute 


Senior Faculty Search in Carbon Mitigation 


Princeton Environmental Institute — the interdisciplinary center of 
environmental research, education and outreach at Princeton University 
-- seeks distinguished candidates for a senior appointment (tenured 
level) in any field relating to climate change and carbon mitigation. 
We seek an individual with a demonstrated record of excellence in 
disciplinary scholarship and teaching, with complementary interests 
in interdisciplinary research and an ability to communicate to, and 
work with, entities outside the academy, including governments, 
nongovernmental organizations, industry partners, and the broader 
public. We are particularly interested in candidates who can take 
an active leadership role in Princeton’s Carbon Mitigation Initiative 
(https://cmi.princeton.edu). The successful candidate will be jointly 
appointed in the department best suited to her or his research areas and 
in the Princeton Environmental Institute. 


Applicants should apply online at https://www.princeton.edu/acad- 
positions/position/9661 and should submit a letter of interest along 
with a vitae, research statement, teaching statement, and the names 
and contact information of three potential referees. The letter of interest 
should include the candidate’s vision of the field and identify major 
unanswered questions of interest to them. Evaluation of applicants 
will begin on January 8, 2019 and continue until the position is filled. 


We seek faculty members who will create a climate that embraces 
excellence and diversity, with a strong commitment to teaching and 
mentoring that will enhance the work of the institute and attract and 
retain a diverse student body. 


XX) Wake Forest” 
School of Medicine 


Wake Forest School of Medicine 
Department of Neurobiology and Anatomy 


NEUROSCIENCE POSITION 


The Department of Neurobiology and Anatomy is initiating a search 
for a tenure-track faculty member working in SENSORY/MOTOR 
NEUROSCIENCE. The selected candidate will interact with a large and 
collaborative group of sensory, cognitive and motor systems neuroscientists 
with funded research programs spanning the visual, auditory, somatosensory, 
and motor systems, as well as training grants emphasizing integrative 
function and development. In addition to maintaining a robust internationally 
recognized research program, successful candidates will be expected to 
support institutional and departmental teaching and mentoring for graduate 
students, medical students, and junior faculty. Start-up package, laboratory 
space, salary and rank will be commensurate with experience. Information 
on the department may be viewed at: 

https://school.wakehealth.edu/departments/Neurobiology-and-Anatomy 


Wake Forest School of Medicine offers a dynamic environment for cutting 
edge multidisciplinary research. Collaborative opportunities with other 
research entities include the WFU Primate Facility; Alzheimer’s Disease 
Center; NCI-funded Comprehensive Cancer Center; Wake Forest Institute 
for Regenerative Medicine (WFIRM); Sticht Center on Aging; and others. 
Wake Forest is an Affirmative Action/Equal Opportunity Employer. 


Winston-Salem is a vibrant community with an exciting cultural environment 
that is perennially ranked among the best places to live in the U.S. Situated 
within close proximity to both the Blue Ridge Mountains and the coast, 
Winston-Salem allows residents opportunity for a wide array of recreational 
activities. Applicants should submit an application package as a single PDF 
consisting of their curriculum vitae, a brief description of current and future 
research interests, and contact information for three references to: 
Lindsay Teague, Talent Acquisition, Wake Forest Baptist Medical 
Center at: Lindsay.Teague@wakehealth.edu or (336) 716-8393. 
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SANOFI - INSTITUT PASTEUR 2018 AWARDS 7TH EDITION 


For this new edition, the Sanofi - Institut Pasteur Awards foster mM, 
scientific excellence for Global Health. 2018 AWARDS 


FOR BIOMEDICAL RESEARCH 


Ns 


INTERNATIONAL SENIOR AWARD INTERNATIONAL MID-CAREER AWARD 


Dr. Claude-Agnés REYNAUD = i Dr. Maria MOTA 


and Dr. Jean-Claude WEILL Instituto de Medicina 


Institut Necker-Enfants Malades, ff: : fic Molecular Jodo Lobo Antunes, 
Paris, France ‘ Lisbon 


Claude-Agnés Reynaud and Jean-Claude Weill were awarded for Maria Mota was awarded for her contributions to elucidate how 
discovering novel mechanisms by which B-cell antibody repertories Plasmodium regulates iron levels in order to survive. Her scientific 
are generated. They are amongst the world’s leading experts on research is widely recognized and respected by the malaria community. 


adaptive immunology. 
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SANOFI 


Institut Pasteur 


Northeastern University 
College of Engineering 


With 178 tenured/tenure-track faculty (80 hired 
since 2013), and 13 state-of-the-art research 
centers, with funding by eight federal agencies, 
Northeastern’s College of Engineering is in a 
period of dynamic growth. Our emphasis on 
interdisciplinary, transformative and innovative 
research—tied to Northeastern’s unique history 
of industry collaboration via the university’s 
signature cooperative education program— 
enables partnerships with academic institutions, 
medical research centers, and companies 

near our centrally located Boston campus and 
around the globe. 


The college seeks outstanding faculty 
candidates in all five departments. 


Consideration will be given to candidates at the 
assistant, associate, and full professor levels; 
successful applicants will lead internationally 
recognized research programs aligned 

with one or more of the college’s strategic 
research initiatives. 


Learn more and apply at 
coe.neu.edu/faculty/positions 


Northeastern University is an Equal Opportunity, Affirmative Action 
Educational Institution and Employer, Title IX University. All qualified 
applicants will receive consideration for employment without regard 
to race, color, religion, sex, national origin, disability status, protected 
veteran status, or any other characteristic protected by the law. 
Northeastern University is an E-Verify Employer. 


Faculty Position in Bacteriology 
Department of Microbiology-Ilmmunology 
Northwestern University Feinberg School of Medicine, Chicago 


M Northwestern Medicine 


Feinberg School of Medicine 


The Department of Microbiology-Immunology at Northwestern University Feinberg School of Medicine 
seeks a full-time tenure track Investigator at the rank of Assistant, Associate, or Full Professor level within 
the broad field of bacteriology. Applicants in the fields of antibiotic resistance, bacterial pathogenesis, 
microbiome, and host-pathogen interactions are particularly encouraged to apply. 


Qualified candidates are expected to have a Ph.D., M.D., or equivalent degree as well as postdoctoral 
training. The successful candidate will be expected to establish and maintain a vigorous, extramurally 
funded research program and to participate in our strong graduate and medical student training programs. 
All applicants should have substantial peer-reviewed publications that demonstrate productivity and 
the ability to perform cutting edge research. Candidates for an Assistant Professor position should have 
current or pending external funding, which could include an NIH K level award or equivalent. Candidates 
at the Associate Professor or Full Professor level should have substantial research productivity, current 
grant support and academic service. The primary criteria for selection will be excellence and creativity 
in research and scholarship. We offer a highly interactive collegial research environment with state-of- 
the-art research facilities. 


The start date is negotiable and the position will remain open until filled. 


Please read ALL instructions and make preparations before proceeding to the application page: 
* Applications will only be accepted via online submission (see link below). 

Please prepare all documents in advance as Adobe PDF files, and please be sure all information 
is entered correctly and accurately (especially names and email addresses), as there will be no 
opportunity for online revision after your application has been submitted. 
All required fields in the application form are marked with an asterisk and must be filled before 
clicking the “Submit” button. 
Be aware that incomplete applications cannot be saved. 


Applications accepted here: https://facultyrecruiting.northwestern.edu/apply/NTI= 


Northwestern University is an Equal Opportunity, Affirmative Action Employer of all protected 
classes, including veterans and individuals with disabilities. Women, racial and ethnic minorities, 
individuals with disabilities, and veterans are encouraged to apply. Hiring is contingent upon 
eligibility to work in the United States. 


WORKING LIFE 


By Elizabeth Marchio 


862 


Climbing out of the bottle 


opened my swollen, bleary eyes to see a young police officer crouched close to my face, pen hover- 

ing over a clipboard. It was the same officer who had taken my mugshot. I had woken from an 

uncomfortable sleep, my body draped across three chairs in the police station holding room, my 

arms pulled inside my shirt in a futile attempt to keep warm. I peeked at the clock: 3:13 a.m. My 

body ached so much that I couldn’t focus on his words. Methodically, he ran through the intake 

form. “Are you feeling hopeless or have nothing to look forward to?” I blinked back a new round of 
tears, slowly understanding that he was asking me whether I intended to commit suicide while I was 
held in jail. This was rock bottom. It was also the wake-up call I needed. 


Five years into a Ph.D. program 
and separated from my husband—a 
decision I had made—I was unable 
to right myself from a deep depres- 
sion and had just been arrested for 
driving while intoxicated. My hus- 
band and I had started our Ph.D. 
programs at the same time. We were 
older than our classmates, which 
increased the pressure to succeed, 
and managing the workload put a 
major strain on our relationship. 
If we allowed anything other than 
work to be our top priority, even 
temporarily, we were overcome by 
guilt. We stopped making time to 
go camping or take regular vaca- 
tions. I felt neglected and seemed to 
want more from him than he could 
provide. Although we both wanted 
to start a family at some point, it 
was not the “right time.” We would 
always have time for our relationship later, we thought. 

We were wrong. The tiny crack of disconnect widened. 
After 4 years as Ph.D. students, we went our own ways. 

I struggled to continue on my Ph.D. journey alone. My 
isolation increased as friends graduated and left the area. 
Social media reminded me of the things I hadn’t yet accom- 
plished and sometimes felt I was too old to ever realize: a 
permanent job, a home, a family of my own—the American 
dream. Healthy coping mechanisms—such as going to the 
gym, joining a sports team, or simply sitting with my in- 
tense disappointment and fear about the future—felt like 
luxuries that I could not afford. Instead, my new normal 
became drinking after a hard day—which was just about 
every day. I knew my family had a history of alcoholism that 
had blighted my mother’s childhood, but I did not think my 
mode of dealing with stress was self-destructive. 

No, that’s not true. I knew I was unhappy. I knew that 
waking up hungover with little recollection of the night 


“Healthy coping mechanisms 
... felt like luxuries 
that I could not afford.” 


before meant a day of work lost. 
Nevertheless, I did not care, as long 
as drinking brought momentary 
relief. I told myself that I was drink- 
ing as a way to cope with feeling 
left behind in life, but really, I was 
self-medicating to stop feeling any- 
thing at all. 

It ended with a wrecked car and 
an arrest. Fortunately, no one was 
physically harmed, but they could 
have been. After a year of court 
proceedings, I am now 4 months 
into a year of probation, which 
includes thrice-daily breathalyzer 
tests, bimonthly urinalyses, courses 
at Mothers Against Drunk Driving, 
fines, fees, and more shame than 
I ever thought possible. At first, it 
all seemed unfair. I did not think I 
had a problem, and I fought tooth 
and nail against the punishments. 

But after hearing from a mother who lost her daughter 
because of a drunk driver, I finally faced up to my part in 
everything that happened. I had made the decision to pur- 
sue a Ph.D. I had made the decision to leave my husband. 
And I had made the decision to drink and drive. Now, I 
needed to make the decision to get my life back on track. 
I got my drinking under control. Somewhat miraculously, I 
just successfully defended my Ph.D. thesis, which brings a 
mixture of pride, relief, and regret that it cost me so much. 

For others who are precariously close to the edge of 
their own self-destructive whirlpool, perhaps my story can 
serve as a cautionary tale. Life does not stop while you are 
in graduate school. Graduate school is a part of life. You 
have to take care of yourself and find healthy ways to cope. 
You have to pay attention. 


Elizabeth Marchio is finishing her Ph.D. at Texas A&M 
University in College Station. 
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